• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过国家临床队列协作中的电子健康记录进行因果推断:长期新冠研究中的挑战与解决方案

Causal Inference via Electronic Health Records in the National Clinical Cohort Collaborative: Challenges and Solutions in Long COVID Research.

作者信息

Butzin-Dozier Zachary, Ji Yunwen, Wang Lin-Chiun, Anzalone A Jerrod, Hurwitz Eric, Patel Rena C, van der Laan Mark J, Colford John M, Hubbard Alan E

机构信息

School of Public Health, University of California, Berkeley, Berkeley, CA USA.

University of Nebraska Medical Center, Omaha, NE USA.

出版信息

medRxiv. 2025 Jun 11:2025.06.06.25329168. doi: 10.1101/2025.06.06.25329168.

DOI:10.1101/2025.06.06.25329168
PMID:40502605
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12155030/
Abstract

Observational analyses of electronic health record (EHR) data using databases such as the National Clinical Cohort Collaborative include unique challenges for researchers seeking causal inferences, particularly when evaluating subjectively-defined outcomes like Long COVID. We explore several challenges and describe potential solutions. 1. Lack of true negatives: Many diagnoses and conditions either have a positive indicator or a missing status, requiring investigators to carefully consider which patients are likely negative for this condition. 2. Differential monitoring: EHR data include nonrandom missingness driven by patients engaging with the healthcare system at different rates, which is often related to both the exposure and outcome of interest. 3. Bias: EHR data sources face many biases, but are particularly vulnerable to informative missingness, differential monitoring, and model misspecification. 4. Large sample size: High precision (i.e., narrow confidence intervals) paired with potential bias leads to a high risk of incorrectly rejecting the null hypothesis. 5. Defining index time: It is important that investigators deliberately define index time (i.e., , baseline) to ensure that they only adjust for baseline confounders and do not adjust for (or condition on) factors that are affected by the exposure of interest (i.e., colliders or mediators). 6. Parameter selection: Investigators should only select parameters that are supported by the data distribution. This manuscript provides an overview of these challenges and solutions, using both simulated data and real-world data, with the outcome of Long COVID as the running example.

摘要

使用国家临床队列协作等数据库对电子健康记录(EHR)数据进行观察性分析,对于寻求因果推断的研究人员来说存在独特的挑战,尤其是在评估像长期新冠这样主观定义的结果时。我们探讨了几个挑战并描述了潜在的解决方案。1. 缺乏真正的阴性病例:许多诊断和病症要么有阳性指标,要么状态缺失,这就要求研究人员仔细考虑哪些患者可能对此病症呈阴性。2. 差异监测:EHR数据包括由患者以不同速率参与医疗保健系统所驱动的非随机缺失,这通常与感兴趣的暴露因素和结果都有关。3. 偏差:EHR数据源面临许多偏差,但特别容易受到信息性缺失、差异监测和模型错误设定的影响。4. 大样本量:高精度(即窄置信区间)与潜在偏差相结合,导致错误拒绝原假设的风险很高。5. 定义索引时间:研究人员刻意定义索引时间(即,基线)很重要,以确保他们只对基线混杂因素进行调整,而不对受感兴趣的暴露因素影响的因素(即对撞机或中介变量)进行调整(或基于这些因素进行条件设定)。6. 参数选择:研究人员应只选择由数据分布支持的参数。本手稿使用模拟数据和真实世界数据,以长期新冠的结果作为实例,概述了这些挑战和解决方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09eb/12169158/7236de915db2/nihpp-2025.06.06.25329168v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09eb/12169158/6b7e8eff2e15/nihpp-2025.06.06.25329168v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09eb/12169158/8e2b033603fa/nihpp-2025.06.06.25329168v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09eb/12169158/20db77680069/nihpp-2025.06.06.25329168v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09eb/12169158/7236de915db2/nihpp-2025.06.06.25329168v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09eb/12169158/6b7e8eff2e15/nihpp-2025.06.06.25329168v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09eb/12169158/8e2b033603fa/nihpp-2025.06.06.25329168v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09eb/12169158/20db77680069/nihpp-2025.06.06.25329168v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09eb/12169158/7236de915db2/nihpp-2025.06.06.25329168v2-f0004.jpg

相似文献

1
Causal Inference via Electronic Health Records in the National Clinical Cohort Collaborative: Challenges and Solutions in Long COVID Research.通过国家临床队列协作中的电子健康记录进行因果推断:长期新冠研究中的挑战与解决方案
medRxiv. 2025 Jun 11:2025.06.06.25329168. doi: 10.1101/2025.06.06.25329168.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Aspects of Genetic Diversity, Host Specificity and Public Health Significance of Single-Celled Intestinal Parasites Commonly Observed in Humans and Mostly Referred to as 'Non-Pathogenic'.人类常见且大多被称为“非致病性”的单细胞肠道寄生虫的遗传多样性、宿主特异性及公共卫生意义
APMIS. 2025 Sep;133(9):e70036. doi: 10.1111/apm.70036.
4
[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].[容量与健康结果:来自系统评价和意大利医院数据评估的证据]
Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.
5
Measures implemented in the school setting to contain the COVID-19 pandemic.学校为控制 COVID-19 疫情而采取的措施。
Cochrane Database Syst Rev. 2022 Jan 17;1(1):CD015029. doi: 10.1002/14651858.CD015029.
6
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
7
Healthcare workers' informal uses of mobile phones and other mobile devices to support their work: a qualitative evidence synthesis.医护人员非正规使用手机和其他移动设备来支持工作:定性证据综合评价。
Cochrane Database Syst Rev. 2024 Aug 27;8(8):CD015705. doi: 10.1002/14651858.CD015705.pub2.
8
Healthcare outcomes assessed with observational study designs compared with those assessed in randomized trials.与随机试验中评估的医疗保健结果相比,观察性研究设计评估的医疗保健结果。
Cochrane Database Syst Rev. 2014 Apr 29;2014(4):MR000034. doi: 10.1002/14651858.MR000034.pub2.
9
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
10
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.

本文引用的文献

1
"Target Trial Emulation" for Observational Studies - Potential and Pitfalls.观察性研究中的“目标试验模拟”——潜力与陷阱
N Engl J Med. 2024 Nov 28;391(21):1975-1977. doi: 10.1056/NEJMp2407586. Epub 2024 Nov 23.
2
SSRI use during acute COVID-19 and risk of long COVID among patients with depression.SSRIs 在急性 COVID-19 期间的使用与抑郁症患者长新冠风险的关系。
BMC Med. 2024 Oct 8;22(1):445. doi: 10.1186/s12916-024-03655-x.
3
Predicting Long COVID in the National COVID Cohort Collaborative Using Super Learner: Cohort Study.
利用超级学习者预测全国 COVID 队列协作中的长新冠:队列研究。
JMIR Public Health Surveill. 2024 Aug 15;10:e53322. doi: 10.2196/53322.
4
A causal roadmap for generating high-quality real-world evidence.生成高质量真实世界证据的因果路线图。
J Clin Transl Sci. 2023 Sep 22;7(1):e212. doi: 10.1017/cts.2023.635. eCollection 2023.
5
Potential Factors Associated with Healthcare Utilization for Balance Problems in Community-Dwelling Adults within the United States: A Narrative Review.美国社区居住成年人平衡问题医疗保健利用的相关潜在因素:一项叙述性综述。
Healthcare (Basel). 2023 Aug 26;11(17):2398. doi: 10.3390/healthcare11172398.
6
Target Trial Emulation to Improve Causal Inference from Observational Data: What, Why, and How?目标试验模拟提高观察性数据因果推断:是什么,为什么,怎么做?
J Am Soc Nephrol. 2023 Aug 1;34(8):1305-1314. doi: 10.1681/ASN.0000000000000152. Epub 2023 May 3.
7
Potential pitfalls in the use of real-world data for studying long COVID.使用真实世界数据研究长期新冠的潜在陷阱。
Nat Med. 2023 May;29(5):1040-1043. doi: 10.1038/s41591-023-02274-y.
8
Practical considerations for specifying a super learner.指定超级学习者的实用考虑因素。
Int J Epidemiol. 2023 Aug 2;52(4):1276-1285. doi: 10.1093/ije/dyad023.
9
Target Trial Emulation: A Framework for Causal Inference From Observational Data.目标试验模拟:一种从观察性数据进行因果推断的框架。
JAMA. 2022 Dec 27;328(24):2446-2447. doi: 10.1001/jama.2022.21383.
10
Learning Causal Effects From Observational Data in Healthcare: A Review and Summary.从医疗保健观察数据中学习因果效应:综述与总结
Front Med (Lausanne). 2022 Jul 7;9:864882. doi: 10.3389/fmed.2022.864882. eCollection 2022.