• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

将样本相似度纳入潜在类别分析:一种树状收缩方法。

Integrating sample similarities into latent class analysis: a tree-structured shrinkage approach.

机构信息

Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.

Environmental and Occupational Health, Milken Institute School of Public Health, The George Washington University, Washington, District of Columbia, USA.

出版信息

Biometrics. 2023 Mar;79(1):264-279. doi: 10.1111/biom.13580. Epub 2021 Nov 10.

DOI:10.1111/biom.13580
PMID:34658017
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10642217/
Abstract

This paper is concerned with using multivariate binary observations to estimate the probabilities of unobserved classes with scientific meanings. We focus on the setting where additional information about sample similarities is available and represented by a rooted weighted tree. Every leaf in the given tree contains multiple samples. Shorter distances over the tree between the leaves indicate a priori higher similarity in class probability vectors. We propose a novel data integrative extension to classical latent class models with tree-structured shrinkage. The proposed approach enables (1) borrowing of information across leaves, (2) estimating data-driven leaf groups with distinct vectors of class probabilities, and (3) individual-level probabilistic class assignment given the observed multivariate binary measurements. We derive and implement a scalable posterior inference algorithm in a variational Bayes framework. Extensive simulations show more accurate estimation of class probabilities than alternatives that suboptimally use the additional sample similarity information. A zoonotic infectious disease application is used to illustrate the proposed approach. The paper concludes by a brief discussion on model limitations and extensions.

摘要

本文旨在利用多元二分类观测值来估计具有科学意义的未观测类别的概率。我们关注的是存在关于样本相似性的附加信息且其由有根加权树表示的设定。给定树中的每个叶子包含多个样本。叶子之间的树距离越短,则先验类概率向量的相似度越高。我们提出了一种新颖的数据集成方法,将其应用于具有树结构收缩的经典潜在类别模型中。所提出的方法能够(1)在叶子之间进行信息借用,(2)估计具有不同类概率向量的数据驱动叶子组,以及(3)基于观察到的多元二分类测量值进行个体级别的概率分类分配。我们在变分贝叶斯框架中推导出并实现了一个可扩展的后验推断算法。广泛的模拟结果表明,该方法比那些次优地利用附加样本相似性信息的替代方法能够更准确地估计类概率。一个人畜共患传染病应用案例用于说明所提出的方法。最后,文章简要讨论了模型的局限性和扩展。

相似文献

1
Integrating sample similarities into latent class analysis: a tree-structured shrinkage approach.将样本相似度纳入潜在类别分析:一种树状收缩方法。
Biometrics. 2023 Mar;79(1):264-279. doi: 10.1111/biom.13580. Epub 2021 Nov 10.
2
Tree-informed Bayesian multi-source domain adaptation: cross-population probabilistic cause-of-death assignment using verbal autopsy.基于树的贝叶斯多源域自适应:使用死因推断调查进行跨人群概率死因分配。
Biostatistics. 2024 Oct 1;25(4):1233-1253. doi: 10.1093/biostatistics/kxae005.
3
Group-representative functional network estimation from multi-subject fMRI data via MRF-based image segmentation.基于马尔可夫随机场图像分割的多体素 fMRI 数据的群组代表性功能网络估计。
Comput Methods Programs Biomed. 2019 Oct;179:104976. doi: 10.1016/j.cmpb.2019.07.004. Epub 2019 Jul 19.
4
Probabilistic cause-of-disease assignment using case-control diagnostic tests: A latent variable regression approach.使用病例对照诊断测试进行疾病病因的概率分配:一种潜在变量回归方法。
Stat Med. 2021 Feb 20;40(4):823-841. doi: 10.1002/sim.8804. Epub 2020 Nov 6.
5
Fair-balance paradox, star-tree paradox, and Bayesian phylogenetics.公平平衡悖论、星树悖论与贝叶斯系统发育学
Mol Biol Evol. 2007 Aug;24(8):1639-55. doi: 10.1093/molbev/msm081. Epub 2007 May 7.
6
A Bayesian approach to restricted latent class models for scientifically structured clustering of multivariate binary outcomes.一种贝叶斯方法,用于对多元二分类结局进行科学结构聚类的约束潜类模型。
Biometrics. 2021 Dec;77(4):1431-1444. doi: 10.1111/biom.13388. Epub 2020 Oct 28.
7
Latent-space variational bayes.潜在空间变分贝叶斯
IEEE Trans Pattern Anal Mach Intell. 2008 Dec;30(12):2236-42. doi: 10.1109/TPAMI.2008.157.
8
Bayesian Inference for an Unknown Number of Attributes in Restricted Latent Class Models.受限潜在类别模型中未知属性数量的贝叶斯推断
Psychometrika. 2023 Jun;88(2):613-635. doi: 10.1007/s11336-022-09900-7. Epub 2023 Jan 22.
9
Variational Supertrees for Bayesian Phylogenetics.变分超级树在贝叶斯系统发生学中的应用。
Bull Math Biol. 2024 Aug 5;86(9):114. doi: 10.1007/s11538-024-01338-5.
10
Greedy learning of binary latent trees.贪婪学习二进制潜在树。
IEEE Trans Pattern Anal Mach Intell. 2011 Jun;33(6):1087-97. doi: 10.1109/TPAMI.2010.145.

引用本文的文献

1
Pediatric urinary tract infections caused by poultry-associated .由家禽相关.引起的小儿尿路感染
Microbiol Spectr. 2024 Jul 2;12(7):e0341523. doi: 10.1128/spectrum.03415-23. Epub 2024 Jun 12.
2
Bayesian estimation methods for survey data with potential applications to health disparities research.用于调查数据的贝叶斯估计方法及其在健康差异研究中的潜在应用。
Wiley Interdiscip Rev Comput Stat. 2024 Jan-Feb;16(1). doi: 10.1002/wics.1633. Epub 2023 Aug 28.
3
Tree-informed Bayesian multi-source domain adaptation: cross-population probabilistic cause-of-death assignment using verbal autopsy.

本文引用的文献

1
Revisiting a Key Innovation in Evolutionary Biology: Felsenstein's "Phylogenies and the Comparative Method".重温进化生物学的一个关键创新:费雪斯坦的“系统发育与比较方法”。
Am Nat. 2019 Jun;193(6):755-772. doi: 10.1086/703055. Epub 2019 Apr 23.
2
Escherichia coli ST131-22 as a Foodborne Uropathogen.大肠杆菌 ST131-22 作为食源性尿路病原体。
mBio. 2018 Aug 28;9(4):e00470-18. doi: 10.1128/mBio.00470-18.
3
TENSOR DECOMPOSITIONS AND SPARSE LOG-LINEAR MODELS.张量分解与稀疏对数线性模型
基于树的贝叶斯多源域自适应:使用死因推断调查进行跨人群概率死因分配。
Biostatistics. 2024 Oct 1;25(4):1233-1253. doi: 10.1093/biostatistics/kxae005.
4
Using source-associated mobile genetic elements to identify zoonotic extraintestinal infections.利用与宿主相关的移动遗传元件来识别动物源性肠外感染。
One Health. 2023 Feb 28;16:100518. doi: 10.1016/j.onehlt.2023.100518. eCollection 2023 Jun.
Ann Stat. 2017;45(1):1-38. doi: 10.1214/15-AOS1414. Epub 2017 Feb 21.
4
Colonizing opportunistic pathogens (COPs): The beasts in all of us.定植性机会致病菌(COPs):我们所有人身体里的“野兽” 。
PLoS Pathog. 2017 Aug 10;13(8):e1006369. doi: 10.1371/journal.ppat.1006369. eCollection 2017 Aug.
5
Logistic Stick-Breaking Process.逻辑折断过程
J Mach Learn Res. 2011 Jan;12(Jan):203-239.
6
Social meets molecular: Combining phylogenetic and latent class analyses to understand HIV-1 transmission in Switzerland.社会与分子相遇:结合系统进化和潜在类别分析了解瑞士的 HIV-1 传播。
Am J Epidemiol. 2014 Jun 15;179(12):1514-25. doi: 10.1093/aje/kwu076. Epub 2014 May 12.
7
Nonparametric Bayesian models through probit stick-breaking processes.通过概率单位折断过程的非参数贝叶斯模型。
Bayesian Anal. 2011 Mar 1;6(1). doi: 10.1214/11-BA605.
8
Nonparametric Bayes Modeling of Multivariate Categorical Data.多变量分类数据的非参数贝叶斯建模
J Am Stat Assoc. 2012 Jan 1;104(487):1042-1051. doi: 10.1198/jasa.2009.tm08439.
9
Insights into latent class analysis of diagnostic test performance.诊断试验性能的潜在类别分析见解。
Biostatistics. 2007 Apr;8(2):474-84. doi: 10.1093/biostatistics/kxl038. Epub 2006 Nov 3.
10
Contamination of retail foods, particularly turkey, from community markets (Minnesota, 1999-2000) with antimicrobial-resistant and extraintestinal pathogenic Escherichia coli.社区市场(明尼苏达州,1999 - 2000年)零售食品,尤其是火鸡,受到耐抗菌药物和肠外致病性大肠杆菌的污染。
Foodborne Pathog Dis. 2005 Spring;2(1):38-49. doi: 10.1089/fpd.2005.2.38.