• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在存在群体异质性的情况下综合外部聚合信息:一种惩罚经验似然方法。

Synthesizing external aggregated information in the presence of population heterogeneity: A penalized empirical likelihood approach.

机构信息

Department of Epidemiology & Biostatistics, University of California at San Francisco, San Francisco, California.

Department of Biostatistics, Mailman School of Public Health, Columbia University, New York, New York.

出版信息

Biometrics. 2022 Jun;78(2):679-690. doi: 10.1111/biom.13429. Epub 2021 Feb 11.

DOI:10.1111/biom.13429
PMID:33528028
Abstract

With the increasing availability of data in the public domain, there has been a growing interest in exploiting information from external sources to improve the analysis of smaller scale studies. An emerging challenge in the era of big data is that the subject-level data are high dimensional, but the external information is at an aggregate level and of a lower dimension. Moreover, heterogeneity and uncertainty in the auxiliary information are often not accounted for in information synthesis. In this paper, we propose a unified framework to summarize various forms of aggregated information via estimating equations and develop a penalized empirical likelihood approach to incorporate such information in logistic regression. When the homogeneity assumption is violated, we extend the method to account for population heterogeneity among different sources of information. When the uncertainty in the external information is not negligible, we propose a variance estimator adjusting for the uncertainty. The proposed estimators are asymptotically more efficient than the conventional penalized maximum likelihood estimator and enjoy the oracle property even with a diverging number of predictors. Simulation studies show that the proposed approaches yield higher accuracy in variable selection compared with competitors. We illustrate the proposed methodologies with a pediatric kidney transplant study.

摘要

随着公共领域数据的日益丰富,人们越来越感兴趣地利用来自外部资源的信息来改进较小规模研究的分析。在大数据时代,一个新兴的挑战是,主体级别的数据是高维的,但外部信息是聚合水平的,且维度较低。此外,辅助信息中的异质性和不确定性通常在信息综合中没有得到考虑。在本文中,我们提出了一个统一的框架,通过估计方程来总结各种形式的聚合信息,并开发了一种惩罚经验似然方法,将此类信息纳入逻辑回归中。当同质性假设被违反时,我们将方法扩展到考虑不同信息源之间的人群异质性。当外部信息的不确定性不可忽略时,我们提出了一种方差估计器来调整不确定性。所提出的估计量在渐近意义上比传统的惩罚最大似然估计量更有效,即使预测变量数量不断增加,也能享有 Oracle 性质。模拟研究表明,与竞争对手相比,所提出的方法在变量选择方面具有更高的准确性。我们用儿科肾移植研究来说明所提出的方法。

相似文献

1
Synthesizing external aggregated information in the presence of population heterogeneity: A penalized empirical likelihood approach.在存在群体异质性的情况下综合外部聚合信息:一种惩罚经验似然方法。
Biometrics. 2022 Jun;78(2):679-690. doi: 10.1111/biom.13429. Epub 2021 Feb 11.
2
Synthesizing external aggregated information in the penalized Cox regression under population heterogeneity.在人群异质性下的惩罚 Cox 回归中综合外部聚合信息。
Stat Med. 2021 Oct 15;40(23):4915-4930. doi: 10.1002/sim.9101. Epub 2021 Jun 16.
3
A unified approach for synthesizing population-level covariate effect information in semiparametric estimation with survival data.一种在生存数据的半参数估计中综合总体水平协变量效应信息的统一方法。
Stat Med. 2020 May 15;39(10):1573-1590. doi: 10.1002/sim.8499. Epub 2020 Feb 19.
4
Semiparametric estimation of the transformation model by leveraging external aggregate data in the presence of population heterogeneity.利用群体异质性下外部聚合数据对半参数变换模型进行估计。
Biometrics. 2023 Sep;79(3):1996-2009. doi: 10.1111/biom.13778. Epub 2022 Nov 10.
5
Simultaneous selection and incorporation of consistent external aggregate information.同时选择和整合一致的外部聚合信息。
Stat Med. 2023 Dec 30;42(30):5630-5645. doi: 10.1002/sim.9929. Epub 2023 Oct 3.
6
Efficient Estimation of the Cox Model With Auxiliary Subgroup Survival Information.利用辅助亚组生存信息对Cox模型进行有效估计。
J Am Stat Assoc. 2016;111(514):787-799. doi: 10.1080/01621459.2015.1044090. Epub 2016 Aug 18.
7
Semiparametric estimation of the nonmixture cure model with auxiliary survival information.半参数估计非混合治愈模型与辅助生存信息。
Biometrics. 2022 Jun;78(2):448-459. doi: 10.1111/biom.13450. Epub 2021 Mar 26.
8
Censored linear regression in the presence or absence of auxiliary survival information.存在或不存在辅助生存信息时的删失线性回归
Biometrics. 2020 Sep;76(3):734-745. doi: 10.1111/biom.13193. Epub 2019 Dec 16.
9
Penalized Empirical Likelihood for the Sparse Cox Regression Model.稀疏Cox回归模型的惩罚经验似然法
J Stat Plan Inference. 2019 Jul;201:71-85. doi: 10.1016/j.jspi.2018.12.001. Epub 2018 Dec 15.
10
Penalized Bregman divergence for large-dimensional regression and classification.用于高维回归与分类的惩罚布雷格曼散度
Biometrika. 2010 Sep;97(3):551-566. doi: 10.1093/biomet/asq033. Epub 2010 Jun 30.

引用本文的文献

1
Likelihood adaptively incorporated external aggregate information with uncertainty for survival data.对生存数据的不确定性进行适应性整合外部聚集信息。
Biometrics. 2024 Oct 3;80(4). doi: 10.1093/biomtc/ujae120.
2
Integrating external summary information in the presence of prior probability shift: an application to assessing essential hypertension.在存在先验概率转移的情况下整合外部摘要信息:在评估原发性高血压中的应用。
Biometrics. 2024 Jul 1;80(3). doi: 10.1093/biomtc/ujae090.
3
Improving prediction of linear regression models by integrating external information from heterogeneous populations: James-Stein estimators.
通过整合来自异质群体的外部信息来改进线性回归模型的预测:詹姆斯-斯廷(James-Stein)估计量。
Biometrics. 2024 Jul 1;80(3). doi: 10.1093/biomtc/ujae072.
4
Efficient estimation of a Cox model when integrating the subgroup incidence rate information.整合亚组发病率信息时Cox模型的有效估计。
J Appl Stat. 2022 May 4;50(10):2151-2170. doi: 10.1080/02664763.2022.2068512. eCollection 2023.
5
Integrating Information from Existing Risk Prediction Models with No Model Details.整合来自现有风险预测模型的信息且无模型细节。
Can J Stat. 2023 Jun;51(2):355-374. doi: 10.1002/cjs.11701. Epub 2022 Apr 15.
6
Data integration: exploiting ratios of parameter estimates from a reduced external model.数据整合:利用简化外部模型中参数估计值的比率
Biometrika. 2022 Apr 12;110(1):119-134. doi: 10.1093/biomet/asac022. eCollection 2023 Mar.