• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

BAMITA:张量数组的贝叶斯多重填补法

BAMITA: Bayesian multiple imputation for tensor arrays.

作者信息

Jiang Ziren, Li Gen, Lock Eric F

机构信息

Division of Biostatistics and Health Data Science, School of Public Health, University of Minnesota, 2221 University Avenue SE, Minneapolis, MN 55414, United States.

Department of Biostatistics, School of Public Health, University of Michigan, 1415 Washington Heights, M4210, Ann Arbor, MI 48109, United States.

出版信息

Biostatistics. 2024 Dec 31;26(1). doi: 10.1093/biostatistics/kxae047.

DOI:10.1093/biostatistics/kxae047
PMID:39673775
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11823239/
Abstract

Data increasingly take the form of a multi-way array, or tensor, in several biomedical domains. Such tensors are often incompletely observed. For example, we are motivated by longitudinal microbiome studies in which several timepoints are missing for several subjects. There is a growing literature on missing data imputation for tensors. However, existing methods give a point estimate for missing values without capturing uncertainty. We propose a multiple imputation approach for tensors in a flexible Bayesian framework, that yields realistic simulated values for missing entries and can propagate uncertainty through subsequent analyses. Our model uses efficient and widely applicable conjugate priors for a CANDECOMP/PARAFAC (CP) factorization, with a separable residual covariance structure. This approach is shown to perform well with respect to both imputation accuracy and uncertainty calibration, for scenarios in which either single entries or entire fibers of the tensor are missing. For two microbiome applications, it is shown to accurately capture uncertainty in the full microbiome profile at missing timepoints and used to infer trends in species diversity for the population. Documented R code to perform our multiple imputation approach is available at https://github.com/lockEF/MultiwayImputation.

摘要

在多个生物医学领域,数据越来越多地采用多路数组或张量的形式。此类张量往往是不完全观测到的。例如,我们受到纵向微生物组研究的启发,在该研究中,有几个受试者的多个时间点数据缺失。关于张量缺失数据插补的文献越来越多。然而,现有方法给出的是缺失值的点估计,而没有捕捉到不确定性。我们在一个灵活的贝叶斯框架中提出了一种张量多重插补方法,该方法能为缺失条目生成逼真的模拟值,并能在后续分析中传播不确定性。我们的模型对CANDECOMP/PARAFAC(CP)分解使用高效且广泛适用的共轭先验,具有可分离的残差协方差结构。对于张量中单个条目或整个纤维缺失的情况,该方法在插补精度和不确定性校准方面均表现良好。对于两个微生物组应用,结果表明它能准确捕捉缺失时间点处完整微生物组概况中的不确定性,并用于推断总体物种多样性的趋势。执行我们多重插补方法的R代码文档可在https://github.com/lockEF/MultiwayImputation获取。

相似文献

1
BAMITA: Bayesian multiple imputation for tensor arrays.BAMITA:张量数组的贝叶斯多重填补法
Biostatistics. 2024 Dec 31;26(1). doi: 10.1093/biostatistics/kxae047.
2
BAMITA: Bayesian Multiple Imputation for Tensor Arrays.BAMITA:张量数组的贝叶斯多重插补
ArXiv. 2024 Oct 30:arXiv:2410.23412v1.
3
Surveillance of Barrett's oesophagus: exploring the uncertainty through systematic review, expert workshop and economic modelling.巴雷特食管的监测:通过系统评价、专家研讨会和经济模型探索不确定性
Health Technol Assess. 2006 Mar;10(8):1-142, iii-iv. doi: 10.3310/hta10080.
4
Carbon dioxide detection for diagnosis of inadvertent respiratory tract placement of enterogastric tubes in children.用于诊断儿童肠胃管意外置入呼吸道的二氧化碳检测
Cochrane Database Syst Rev. 2025 Feb 19;2(2):CD011196. doi: 10.1002/14651858.CD011196.pub2.
5
Assessing the comparative effects of interventions in COPD: a tutorial on network meta-analysis for clinicians.评估慢性阻塞性肺疾病干预措施的比较效果:面向临床医生的网状Meta分析教程
Respir Res. 2024 Dec 21;25(1):438. doi: 10.1186/s12931-024-03056-x.
6
Immunogenicity and seroefficacy of pneumococcal conjugate vaccines: a systematic review and network meta-analysis.肺炎球菌结合疫苗的免疫原性和血清效力:系统评价和网络荟萃分析。
Health Technol Assess. 2024 Jul;28(34):1-109. doi: 10.3310/YWHA3079.
7
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
8
Optimising dynamic treatment regimens using sequential multiple assignment randomised trials data with missing data.利用带有缺失数据的序贯多组分配随机试验数据优化动态治疗方案
BMC Med Res Methodol. 2025 Jul 1;25(1):162. doi: 10.1186/s12874-025-02595-1.
9
Perceptions and experiences of the prevention, detection, and management of postpartum haemorrhage: a qualitative evidence synthesis.预防、检测和管理产后出血的认知和经验:定性证据综合。
Cochrane Database Syst Rev. 2023 Nov 27;11(11):CD013795. doi: 10.1002/14651858.CD013795.pub2.
10
Diagnostic test accuracy and cost-effectiveness of tests for codeletion of chromosomal arms 1p and 19q in people with glioma.染色体臂 1p 和 19q 缺失的检测在胶质瘤患者中的诊断准确性和成本效益。
Cochrane Database Syst Rev. 2022 Mar 2;3(3):CD013387. doi: 10.1002/14651858.CD013387.pub2.

本文引用的文献

1
Bayesian tensor-on-tensor regression with efficient computation.具有高效计算的贝叶斯张量对张量回归
Stat Interface. 2024;17(2):199-217. doi: 10.4310/23-sii786. Epub 2024 Feb 1.
2
A Fused CP Factorization Method for Incomplete Tensors.一种用于不完全张量的融合CP分解方法。
IEEE Trans Neural Netw Learn Syst. 2019 Mar;30(3):751-764. doi: 10.1109/TNNLS.2018.2851612. Epub 2018 Jul 26.
3
Influence of Feeding Type on Gut Microbiome Development in Hospitalized Preterm Infants.喂养方式对住院早产儿肠道微生物群发育的影响。
Nurs Res. 2017 Mar/Apr;66(2):123-133. doi: 10.1097/NNR.0000000000000208.
4
Tensor decomposition for multiple-tissue gene expression experiments.用于多组织基因表达实验的张量分解
Nat Genet. 2016 Sep;48(9):1094-100. doi: 10.1038/ng.3624. Epub 2016 Aug 1.
5
MULTILINEAR TENSOR REGRESSION FOR LONGITUDINAL RELATIONAL DATA.用于纵向关系数据的多线性张量回归
Ann Appl Stat. 2015 Sep;9(3):1169-1193. doi: 10.1214/15-AOAS839. Epub 2015 Nov 2.
6
Simultaneous tensor decomposition and completion using factor priors.基于因子先验的张量同时分解和完成。
IEEE Trans Pattern Anal Mach Intell. 2014 Mar;36(3):577-91. doi: 10.1109/TPAMI.2013.164.
7
Tensor completion for estimating missing values in visual data.张量完成在视觉数据中估计缺失值。
IEEE Trans Pattern Anal Mach Intell. 2013 Jan;35(1):208-20. doi: 10.1109/TPAMI.2012.39.
8
MissForest--non-parametric missing value imputation for mixed-type data.MissForest--用于混合类型数据的非参数缺失值插补。
Bioinformatics. 2012 Jan 1;28(1):112-8. doi: 10.1093/bioinformatics/btr597. Epub 2011 Oct 28.
9
Spectral Regularization Algorithms for Learning Large Incomplete Matrices.用于学习大型不完整矩阵的谱正则化算法
J Mach Learn Res. 2010 Mar 1;11:2287-2322.
10
Some mathematical notes on three-mode factor analysis.关于三模式因子分析的一些数学注释。
Psychometrika. 1966 Sep;31(3):279-311. doi: 10.1007/BF02289464.