• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

协方差矩阵的收缩估计量。

Shrinkage estimators for covariance matrices.

作者信息

Daniels M J, Kass R E

机构信息

Department of Statistics, Iowa State University, Ames 50011, USA.

出版信息

Biometrics. 2001 Dec;57(4):1173-84. doi: 10.1111/j.0006-341x.2001.01173.x.

DOI:10.1111/j.0006-341x.2001.01173.x
PMID:11764258
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2748251/
Abstract

Estimation of covariance matrices in small samples has been studied by many authors. Standard estimators, like the unstructured maximum likelihood estimator (ML) or restricted maximum likelihood (REML) estimator, can be very unstable with the smallest estimated eigenvalues being too small and the largest too big. A standard approach to more stably estimating the matrix in small samples is to compute the ML or REML estimator under some simple structure that involves estimation of fewer parameters, such as compound symmetry or independence. However, these estimators will not be consistent unless the hypothesized structure is correct. If interest focuses on estimation of regression coefficients with correlated (or longitudinal) data, a sandwich estimator of the covariance matrix may be used to provide standard errors for the estimated coefficients that are robust in the sense that they remain consistent under misspecification of the covariance structure. With large matrices, however, the inefficiency of the sandwich estimator becomes worrisome. We consider here two general shrinkage approaches to estimating the covariance matrix and regression coefficients. The first involves shrinking the eigenvalues of the unstructured ML or REML estimator. The second involves shrinking an unstructured estimator toward a structured estimator. For both cases, the data determine the amount of shrinkage. These estimators are consistent and give consistent and asymptotically efficient estimates for regression coefficients. Simulations show the improved operating characteristics of the shrinkage estimators of the covariance matrix and the regression coefficients in finite samples. The final estimator chosen includes a combination of both shrinkage approaches, i.e., shrinking the eigenvalues and then shrinking toward structure. We illustrate our approach on a sleep EEG study that requires estimation of a 24 x 24 covariance matrix and for which inferences on mean parameters critically depend on the covariance estimator chosen. We recommend making inference using a particular shrinkage estimator that provides a reasonable compromise between structured and unstructured estimators.

摘要

许多作者研究了小样本协方差矩阵的估计。标准估计器,如无结构最大似然估计器(ML)或限制最大似然(REML)估计器,可能非常不稳定,估计出的最小特征值过小,最大特征值过大。在小样本中更稳定地估计矩阵的一种标准方法是在某种简单结构下计算ML或REML估计器,这种结构涉及较少参数的估计,如复合对称或独立性。然而,除非假设的结构正确,这些估计器将不一致。如果关注的是具有相关(或纵向)数据的回归系数估计,可以使用协方差矩阵的三明治估计器来为估计系数提供标准误差,这些标准误差在协方差结构指定错误的情况下仍保持一致,具有稳健性。然而,对于大型矩阵,三明治估计器的低效率令人担忧。我们在此考虑两种估计协方差矩阵和回归系数的一般收缩方法。第一种方法是收缩无结构ML或REML估计器的特征值。第二种方法是将无结构估计器向结构化估计器收缩。对于这两种情况,数据决定收缩量。这些估计器是一致的,并且对回归系数给出一致且渐近有效的估计。模拟显示了有限样本中协方差矩阵和回归系数收缩估计器改进的操作特性。最终选择的估计器包括两种收缩方法的组合,即先收缩特征值,然后向结构收缩。我们在一项睡眠脑电图研究中说明了我们的方法,该研究需要估计一个24×24的协方差矩阵,并且对均值参数的推断严重依赖于所选择的协方差估计器。我们建议使用一种特定的收缩估计器进行推断,该估计器在结构化和非结构化估计器之间提供了合理的折衷。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/731d/2748251/3a13f53c1c3f/nihms-143236-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/731d/2748251/3a13f53c1c3f/nihms-143236-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/731d/2748251/3a13f53c1c3f/nihms-143236-f0001.jpg

相似文献

1
Shrinkage estimators for covariance matrices.协方差矩阵的收缩估计量。
Biometrics. 2001 Dec;57(4):1173-84. doi: 10.1111/j.0006-341x.2001.01173.x.
2
Collaborative double robust targeted maximum likelihood estimation.协作双稳健靶向最大似然估计
Int J Biostat. 2010 May 17;6(1):Article 17. doi: 10.2202/1557-4679.1181.
3
Comparison of bias-corrected covariance estimators for MMRM analysis in longitudinal data with dropouts.纵向数据中含缺失值的重复测量混合模型分析的偏差校正协方差估计量比较
Stat Methods Med Res. 2017 Oct;26(5):2389-2406. doi: 10.1177/0962280215597938. Epub 2015 Aug 11.
4
Closed-form approximations to the REML estimator of a variance ratio (or heritability) in a mixed linear model.混合线性模型中方差比(或遗传力)的REML估计量的闭式近似值。
Biometrics. 2001 Dec;57(4):1148-56. doi: 10.1111/j.0006-341x.2001.01148.x.
5
Shrinkage approach for EEG covariance matrix estimation.脑电图协方差矩阵估计的收缩方法。
Annu Int Conf IEEE Eng Med Biol Soc. 2010;2010:1654-7. doi: 10.1109/IEMBS.2010.5626668.
6
Longitudinal regression of covariance matrix outcomes.协方差矩阵结果的纵向回归
Biostatistics. 2024 Apr 15;25(2):385-401. doi: 10.1093/biostatistics/kxac045.
7
An Empirical Bayes Approach to Shrinkage Estimation on the Manifold of Symmetric Positive-Definite Matrices.一种基于经验贝叶斯方法的对称正定矩阵流形上的收缩估计
J Am Stat Assoc. 2024;119(545):259-272. doi: 10.1080/01621459.2022.2110877. Epub 2022 Sep 27.
8
Principal regression for high dimensional covariance matrices.高维协方差矩阵的主回归
Electron J Stat. 2021;15(2):4192-4235. doi: 10.1214/21-ejs1887. Epub 2021 Sep 14.
9
Sampling distributions, biases, variances, and confidence intervals for genetic correlations.遗传相关性的抽样分布、偏差、方差和置信区间。
Theor Appl Genet. 1997 Jan;94(1):8-19. doi: 10.1007/s001220050375.
10
Double Robust Efficient Estimators of Longitudinal Treatment Effects: Comparative Performance in Simulations and a Case Study.纵向治疗效果的双重稳健有效估计量:模拟中的比较性能及一个案例研究
Int J Biostat. 2019 Feb 26;15(2):/j/ijb.2019.15.issue-2/ijb-2017-0054/ijb-2017-0054.xml. doi: 10.1515/ijb-2017-0054.

引用本文的文献

1
Disentangling signal and noise in neural responses through generative modeling.通过生成模型解析神经反应中的信号与噪声
PLoS Comput Biol. 2025 Jul 21;21(7):e1012092. doi: 10.1371/journal.pcbi.1012092. eCollection 2025 Jul.
2
A New Paradigm for High-dimensional Data: Distance-Based Semiparametric Feature Aggregation Framework via Between-Subject Attributes.高维数据的一种新范式:基于对象间属性的基于距离的半参数特征聚合框架
Scand Stat Theory Appl. 2024 Jun;51(2):672-696. doi: 10.1111/sjos.12695. Epub 2023 Nov 8.
3
Disentangling signal and noise in neural responses through generative modeling.

本文引用的文献

1
The effects of age and gender on sleep EEG power spectral density in the middle years of life (ages 20-60 years old).年龄和性别对中年(20至60岁)睡眠脑电图功率谱密度的影响。
Psychophysiology. 2001 Mar;38(2):232-42.
2
Meta-analysis in clinical trials.临床试验中的荟萃分析。
Control Clin Trials. 1986 Sep;7(3):177-88. doi: 10.1016/0197-2456(86)90046-2.
通过生成模型解析神经反应中的信号与噪声
bioRxiv. 2024 Aug 22:2024.04.22.590510. doi: 10.1101/2024.04.22.590510.
4
An Empirical Bayes Approach to Shrinkage Estimation on the Manifold of Symmetric Positive-Definite Matrices.一种基于经验贝叶斯方法的对称正定矩阵流形上的收缩估计
J Am Stat Assoc. 2024;119(545):259-272. doi: 10.1080/01621459.2022.2110877. Epub 2022 Sep 27.
5
Tuning parameters for polygenic risk score methods using GWAS summary statistics from training data.使用来自训练数据的 GWAS 汇总统计信息调整多基因风险评分方法的参数。
Nat Commun. 2024 Jan 2;15(1):24. doi: 10.1038/s41467-023-44009-0.
6
Flexible Signal Denoising via Flexible Empirical Bayes Shrinkage.通过灵活经验贝叶斯收缩实现灵活信号去噪
J Mach Learn Res. 2021 Jan-Dec;22.
7
Principal regression for high dimensional covariance matrices.高维协方差矩阵的主回归
Electron J Stat. 2021;15(2):4192-4235. doi: 10.1214/21-ejs1887. Epub 2021 Sep 14.
8
Joint modelling of longitudinal response and time-to-event data using conditional distributions: a Bayesian perspective.基于条件分布的纵向响应与事件发生时间数据的联合建模:贝叶斯视角
J Appl Stat. 2021 Mar 9;49(9):2228-2245. doi: 10.1080/02664763.2021.1897971. eCollection 2022.
9
Covariance shrinkage can assess and improve functional connectomes.协方差收缩可评估和改善功能连接组。
Neuroimage. 2022 Aug 1;256:119229. doi: 10.1016/j.neuroimage.2022.119229. Epub 2022 Apr 20.
10
Flexible Bayesian Dynamic Modeling of Correlation and Covariance Matrices.相关矩阵和协方差矩阵的灵活贝叶斯动态建模
Bayesian Anal. 2020 Dec;15(4):1199-1228. doi: 10.1214/19-ba1173. Epub 2019 Nov 4.