• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在存在先验概率转移的情况下整合外部摘要信息:在评估原发性高血压中的应用。

Integrating external summary information in the presence of prior probability shift: an application to assessing essential hypertension.

机构信息

Department of Epidemiology and Public Health, University of Maryland School of Medicine, Baltimore, 21201, United States.

Department of Neurosurgery, University of Maryland School of Medicine, Baltimore, 21201, United States.

出版信息

Biometrics. 2024 Jul 1;80(3). doi: 10.1093/biomtc/ujae090.

DOI:10.1093/biomtc/ujae090
PMID:39248121
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11381951/
Abstract

Recent years have witnessed a rise in the popularity of information integration without sharing of raw data. By leveraging and incorporating summary information from external sources, internal studies can achieve enhanced estimation efficiency and prediction accuracy. However, a noteworthy challenge in utilizing summary-level information is accommodating the inherent heterogeneity across diverse data sources. In this study, we delve into the issue of prior probability shift between two cohorts, wherein the difference of two data distributions depends on the outcome. We introduce a novel semi-parametric constrained optimization-based approach to integrate information within this framework, which has not been extensively explored in existing literature. Our proposed method tackles the prior probability shift by introducing the outcome-dependent selection function and effectively addresses the estimation uncertainty associated with summary information from the external source. Our approach facilitates valid inference even in the absence of a known variance-covariance estimate from the external source. Through extensive simulation studies, we observe the superiority of our method over existing ones, showcasing minimal estimation bias and reduced variance for both binary and continuous outcomes. We further demonstrate the utility of our method through its application in investigating risk factors related to essential hypertension, where the reduced estimation variability is observed after integrating summary information from an external data.

摘要

近年来,信息整合而不共享原始数据的做法越来越流行。通过利用和整合来自外部来源的汇总信息,内部研究可以提高估计效率和预测准确性。然而,利用汇总信息面临的一个挑战是如何适应来自不同数据源的固有异质性。在这项研究中,我们深入研究了两个队列之间的先验概率转移问题,其中两个数据分布的差异取决于结果。我们引入了一种新的半参数约束优化方法来解决这个框架内的信息整合问题,这在现有文献中还没有得到广泛探讨。我们的方法通过引入与结果相关的选择函数来解决先验概率转移问题,并有效地解决了来自外部源的汇总信息的估计不确定性。即使外部源没有已知的方差-协方差估计,我们的方法也能进行有效的推断。通过广泛的模拟研究,我们观察到我们的方法优于现有方法,在二进制和连续结果下,最小化了估计偏差和方差。我们还通过应用于研究与原发性高血压相关的风险因素来展示我们方法的实用性,在整合来自外部数据的汇总信息后,观察到了估计变异性的降低。

相似文献

1
Integrating external summary information in the presence of prior probability shift: an application to assessing essential hypertension.在存在先验概率转移的情况下整合外部摘要信息:在评估原发性高血压中的应用。
Biometrics. 2024 Jul 1;80(3). doi: 10.1093/biomtc/ujae090.
2
Efficient data integration under prior probability shift.在先验概率转移下的高效数据集成。
Biometrics. 2024 Mar 27;80(2). doi: 10.1093/biomtc/ujae035.
3
Improving prediction of linear regression models by integrating external information from heterogeneous populations: James-Stein estimators.通过整合来自异质群体的外部信息来改进线性回归模型的预测:詹姆斯-斯廷(James-Stein)估计量。
Biometrics. 2024 Jul 1;80(3). doi: 10.1093/biomtc/ujae072.
4
Semiparametric estimation of the transformation model by leveraging external aggregate data in the presence of population heterogeneity.利用群体异质性下外部聚合数据对半参数变换模型进行估计。
Biometrics. 2023 Sep;79(3):1996-2009. doi: 10.1111/biom.13778. Epub 2022 Nov 10.
5
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
6
Collaborative double robust targeted maximum likelihood estimation.协作双稳健靶向最大似然估计
Int J Biostat. 2010 May 17;6(1):Article 17. doi: 10.2202/1557-4679.1181.
7
Empirical Bayes Estimation and Prediction Using Summary-Level Information From External Big Data Sources Adjusting for Violations of Transportability.使用来自外部大数据源的汇总级信息进行经验贝叶斯估计和预测,并针对可移植性违规进行调整。
Stat Biosci. 2018 Dec;10(3):568-586. doi: 10.1007/s12561-018-9217-4. Epub 2018 May 14.
8
Part 2. Development of Enhanced Statistical Methods for Assessing Health Effects Associated with an Unknown Number of Major Sources of Multiple Air Pollutants.第2部分。开发增强的统计方法,以评估与多种空气污染物的未知数量主要来源相关的健康影响。
Res Rep Health Eff Inst. 2015 Jun(183 Pt 1-2):51-113.
9
Probability-enhanced sufficient dimension reduction for binary classification.用于二元分类的概率增强型充分降维
Biometrics. 2014 Sep;70(3):546-55. doi: 10.1111/biom.12174. Epub 2014 Apr 29.
10
Improving estimation efficiency for regression with MNAR covariates.提高具有 MAR 协变量的回归估计效率。
Biometrics. 2020 Mar;76(1):270-280. doi: 10.1111/biom.13131. Epub 2019 Nov 7.

本文引用的文献

1
Simultaneous selection and incorporation of consistent external aggregate information.同时选择和整合一致的外部聚合信息。
Stat Med. 2023 Dec 30;42(30):5630-5645. doi: 10.1002/sim.9929. Epub 2023 Oct 3.
2
Integrating Information from Existing Risk Prediction Models with No Model Details.整合来自现有风险预测模型的信息且无模型细节。
Can J Stat. 2023 Jun;51(2):355-374. doi: 10.1002/cjs.11701. Epub 2022 Apr 15.
3
Data integration: exploiting ratios of parameter estimates from a reduced external model.数据整合:利用简化外部模型中参数估计值的比率
Biometrika. 2022 Apr 12;110(1):119-134. doi: 10.1093/biomet/asac022. eCollection 2023 Mar.
4
Risk Projection for Time-to-event Outcome Leveraging Summary Statistics With Source Individual-level Data.利用汇总统计数据和源个体水平数据对事件发生时间结局进行风险预测。
J Am Stat Assoc. 2022;117(540):2043-2055. doi: 10.1080/01621459.2021.1895810. Epub 2021 Apr 22.
5
Semiparametric estimation of the transformation model by leveraging external aggregate data in the presence of population heterogeneity.利用群体异质性下外部聚合数据对半参数变换模型进行估计。
Biometrics. 2023 Sep;79(3):1996-2009. doi: 10.1111/biom.13778. Epub 2022 Nov 10.
6
Improving trial generalizability using observational studies.利用观察性研究提高试验的概括性。
Biometrics. 2023 Jun;79(2):1213-1225. doi: 10.1111/biom.13609. Epub 2022 Jan 11.
7
Improving main analysis by borrowing information from auxiliary data.通过借鉴辅助数据中的信息来改进主要分析。
Stat Med. 2022 Feb 10;41(3):567-579. doi: 10.1002/sim.9252. Epub 2021 Nov 18.
8
Elastic priors to dynamically borrow information from historical data in clinical trials.弹性先验以在临床试验中动态借鉴历史数据中的信息。
Biometrics. 2023 Mar;79(1):49-60. doi: 10.1111/biom.13551. Epub 2021 Sep 20.
9
The Clinician and Dataset Shift in Artificial Intelligence.临床医生与人工智能中的数据集偏移
N Engl J Med. 2021 Jul 15;385(3):283-286. doi: 10.1056/NEJMc2104626.
10
The ARIC (Atherosclerosis Risk In Communities) Study: JACC Focus Seminar 3/8.ARIC(社区动脉粥样硬化风险研究):JACC 重点研讨会 3/8。
J Am Coll Cardiol. 2021 Jun 15;77(23):2939-2959. doi: 10.1016/j.jacc.2021.04.035.