• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过边际竞争组装学习多元分布。

Learning multivariate distributions by competitive assembly of marginals.

机构信息

Department of Applied Mathematics and Statistics, Center for Imaging Science and Institute for Computational Medicine, Johns Hopkins University, Clark Hall, 3400 N. Charles St., Baltimore, MD 21218, USA.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2013 Feb;35(2):398-410. doi: 10.1109/TPAMI.2012.96.

DOI:10.1109/TPAMI.2012.96
PMID:22529323
Abstract

We present a new framework for learning high-dimensional multivariate probability distributions from estimated marginals. The approach is motivated by compositional models and Bayesian networks, and designed to adapt to small sample sizes. We start with a large, overlapping set of elementary statistical building blocks, or "primitives," which are low-dimensional marginal distributions learned from data. Each variable may appear in many primitives. Subsets of primitives are combined in a Lego-like fashion to construct a probabilistic graphical model; only a small fraction of the primitives will participate in any valid construction. Since primitives can be precomputed, parameter estimation and structure search are separated. Model complexity is controlled by strong biases; we adapt the primitives to the amount of training data and impose rules which restrict the merging of them into allowable compositions. The likelihood of the data decomposes into a sum of local gains, one for each primitive in the final structure. We focus on a specific subclass of networks which are binary forests. Structure optimization corresponds to an integer linear program and the maximizing composition can be computed for reasonably large numbers of variables. Performance is evaluated using both synthetic data and real datasets from natural language processing and computational biology.

摘要

我们提出了一种从估计的边缘分布中学习高维多元概率分布的新框架。该方法的灵感来自组合模型和贝叶斯网络,旨在适应小样本量。我们从大量重叠的基本统计构建块或“基元”开始,这些基元是从数据中学习到的低维边缘分布。每个变量都可能出现在许多基元中。基元的子集以乐高式的方式组合在一起,构成一个概率图形模型;只有一小部分基元会参与任何有效的构建。由于基元可以预先计算,因此参数估计和结构搜索是分开的。模型复杂度由强偏差控制;我们根据训练数据的数量调整基元,并施加规则限制它们合并为允许的组合。数据的似然分解为局部增益的和,每个基元在最终结构中一个增益。我们专注于一种特定的网络子类,即二进制森林。结构优化对应于整数线性规划,并且可以为相当多的变量计算最大组合。使用来自自然语言处理和计算生物学的合成数据和真实数据集来评估性能。

相似文献

1
Learning multivariate distributions by competitive assembly of marginals.通过边际竞争组装学习多元分布。
IEEE Trans Pattern Anal Mach Intell. 2013 Feb;35(2):398-410. doi: 10.1109/TPAMI.2012.96.
2
Statistical instance-based pruning in ensembles of independent classifiers.独立分类器集成中的基于统计实例的剪枝
IEEE Trans Pattern Anal Mach Intell. 2009 Feb;31(2):364-9. doi: 10.1109/TPAMI.2008.204.
3
Geometric decision tree.几何决策树
IEEE Trans Syst Man Cybern B Cybern. 2012 Feb;42(1):181-92. doi: 10.1109/TSMCB.2011.2163392. Epub 2011 Sep 1.
4
A dynamic hybrid framework for constrained evolutionary optimization.一种用于约束进化优化的动态混合框架。
IEEE Trans Syst Man Cybern B Cybern. 2012 Feb;42(1):203-17. doi: 10.1109/TSMCB.2011.2161467. Epub 2011 Aug 4.
5
Learning graphical model parameters with approximate marginal inference.用近似边缘推理学习图形模型参数。
IEEE Trans Pattern Anal Mach Intell. 2013 Oct;35(10):2454-67. doi: 10.1109/TPAMI.2013.31.
6
Semisupervised learning of hidden Markov models via a homotopy method.通过同伦方法对隐马尔可夫模型进行半监督学习。
IEEE Trans Pattern Anal Mach Intell. 2009 Feb;31(2):275-87. doi: 10.1109/TPAMI.2008.71.
7
A self-learning particle swarm optimizer for global optimization problems.一种用于全局优化问题的自学习粒子群优化器。
IEEE Trans Syst Man Cybern B Cybern. 2012 Jun;42(3):627-46. doi: 10.1109/TSMCB.2011.2171946. Epub 2011 Nov 4.
8
Graph-based semisupervised learning.基于图的半监督学习。
IEEE Trans Pattern Anal Mach Intell. 2008 Jan;30(1):174-9. doi: 10.1109/TPAMI.2007.70765.
9
Fast rule identification and neighborhood selection for cellular automata.细胞自动机的快速规则识别与邻域选择
IEEE Trans Syst Man Cybern B Cybern. 2011 Jun;41(3):749-60. doi: 10.1109/TSMCB.2010.2091271. Epub 2010 Dec 3.
10
Tailored aggregation for classification.用于分类的定制聚合。
IEEE Trans Pattern Anal Mach Intell. 2009 Nov;31(11):2098-105. doi: 10.1109/TPAMI.2009.55.

引用本文的文献

1
Decoding Immunodeficiencies with Artificial Intelligence: A New Era of Precision Medicine.利用人工智能解码免疫缺陷:精准医学的新时代。
Biomedicines. 2025 Jul 28;13(8):1836. doi: 10.3390/biomedicines13081836.
2
PI Prob: A risk prediction and clinical guidance system for evaluating patients with recurrent infections.PI 问题:用于评估复发性感染患者的风险预测和临床指导系统。
PLoS One. 2021 Feb 16;16(2):e0237285. doi: 10.1371/journal.pone.0237285. eCollection 2021.
3
A Multi-Method Approach for Proteomic Network Inference in 11 Human Cancers.
一种用于11种人类癌症蛋白质组网络推断的多方法途径。
PLoS Comput Biol. 2016 Feb 29;12(2):e1004765. doi: 10.1371/journal.pcbi.1004765. eCollection 2016 Feb.