基于熵图的后验正则化

Entropic Graph-based Posterior Regularization.

作者信息

Libbrecht Maxwell W, Hoffman Michael M, Bilmes Jeffrey A, Noble William S

机构信息

Genome Sciences, Box 355065, Foege Building, S220B, 3720 15th Ave NE, Seattle, WA 98195-5065.

Princess Margaret Cancer Centre, Toronto Medical Discovery Tower 11-311, 101 College St, Toronto, ON M5G 1L7.

出版信息

Proc Int Conf Mach Learn. 2015 Jul;37:1992-2001.

PMID:39483441

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11526501/

Abstract

Graph smoothness objectives have achieved great success in semi-supervised learning but have not yet been applied extensively to unsupervised generative models. We define a new class of entropic graph-based posterior regularizers that augment a probabilistic model by encouraging pairs of nearby variables in a regularization graph to have similar posterior distributions. We present a three-way alternating optimization algorithm with closed-form updates for performing inference on this joint model and learning its parameters. This method admits updates linear in the degree of the regularization graph, exhibits monotone convergence, and is easily parallelizable. We are motivated by applications in computational biology in which temporal models such as hidden Markov models are used to learn a human-interpretable representation of genomic data. On a synthetic problem, we show that our method outperforms existing methods for graph-based regularization and a comparable strategy for incorporating long-range interactions using existing methods for approximate inference. Using genome-scale functional genomics data, we integrate genome 3D interaction data into existing models for genome annotation and demonstrate significant improvements in predicting genomic activity.

摘要

图平滑目标在半监督学习中取得了巨大成功，但尚未广泛应用于无监督生成模型。我们定义了一类新的基于熵的图后验正则化器，通过鼓励正则化图中相邻变量对具有相似的后验分布来增强概率模型。我们提出了一种具有闭式更新的三向交替优化算法，用于对这个联合模型进行推理并学习其参数。该方法允许在正则化图的度上进行线性更新，表现出单调收敛，并且易于并行化。我们的动机来自于计算生物学中的应用，其中诸如隐马尔可夫模型等时间模型用于学习基因组数据的人类可解释表示。在一个合成问题上，我们表明我们的方法优于现有的基于图的正则化方法以及使用现有近似推理方法纳入长程相互作用的可比策略。使用基因组规模的功能基因组学数据，我们将基因组三维相互作用数据整合到现有的基因组注释模型中，并证明在预测基因组活性方面有显著改进。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd17/11526501/2705a5a3aa7a/nihms-1902739-f0001.jpg

相似文献

Entropic Graph-based Posterior Regularization.基于熵图的后验正则化

Proc Int Conf Mach Learn. 2015 Jul;37:1992-2001.

Prescription of Controlled Substances: Benefits and Risks管制药品的处方：益处与风险

Short-Term Memory Impairment短期记忆障碍

Genetic determinants of testicular sperm extraction outcomes: insights from a large multicentre study of men with non-obstructive azoospermia.睾丸精子提取结果的遗传决定因素：来自一项针对非梗阻性无精子症男性的大型多中心研究的见解

Hum Reprod Open. 2025 Aug 29;2025(3):hoaf049. doi: 10.1093/hropen/hoaf049. eCollection 2025.

Plug-and-play use of tree-based methods: consequences for clinical prediction modeling.基于树的方法的即插即用：对临床预测模型的影响。

J Clin Epidemiol. 2025 Aug;184:111834. doi: 10.1016/j.jclinepi.2025.111834. Epub 2025 May 19.

Psychological interventions for adults who have sexually offended or are at risk of offending.针对有性犯罪行为或有性犯罪风险的成年人的心理干预措施。

Cochrane Database Syst Rev. 2012 Dec 12;12(12):CD007507. doi: 10.1002/14651858.CD007507.pub2.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

Neuraminidase inhibitors for preventing and treating influenza in healthy adults and children.用于预防和治疗健康成人及儿童流感的神经氨酸酶抑制剂。

Cochrane Database Syst Rev. 2012 Jan 18;1:CD008965. doi: 10.1002/14651858.CD008965.pub3.

Anterior Approach Total Ankle Arthroplasty with Patient-Specific Cut Guides.使用患者特异性截骨导向器的前路全踝关节置换术。

JBJS Essent Surg Tech. 2025 Aug 15;15(3). doi: 10.2106/JBJS.ST.23.00027. eCollection 2025 Jul-Sep.

A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。

Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.

本文引用的文献

Three-dimensional modeling of the P. falciparum genome during the erythrocytic cycle reveals a strong connection between genome architecture and gene expression.疟原虫基因组在红细胞周期中的三维建模揭示了基因组结构与基因表达之间的紧密联系。

Genome Res. 2014 Jun;24(6):974-88. doi: 10.1101/gr.169417.113. Epub 2014 Mar 26.

Statistical confidence estimation for Hi-C data reveals regulatory chromatin contacts.统计置信度估计用于 Hi-C 数据揭示调控染色质接触。

Genome Res. 2014 Jun;24(6):999-1011. doi: 10.1101/gr.160374.113. Epub 2014 Feb 5.

Integrative annotation of chromatin elements from ENCODE data.整合 ENCODE 数据中的染色质元件注释

Nucleic Acids Res. 2013 Jan;41(2):827-41. doi: 10.1093/nar/gks1284. Epub 2012 Dec 5.

An integrated encyclopedia of DNA elements in the human genome.人类基因组中 DNA 元件的综合百科全书。

Nature. 2012 Sep 6;489(7414):57-74. doi: 10.1038/nature11247.

Topological domains in mammalian genomes identified by analysis of chromatin interactions.哺乳动物基因组中通过分析染色质相互作用而鉴定的拓扑结构域。

Nature. 2012 Apr 11;485(7398):376-80. doi: 10.1038/nature11082.

Unsupervised pattern discovery in human chromatin structure through genomic segmentation.通过基因组分割实现人类染色质结构的无监督模式发现。

Nat Methods. 2012 Mar 18;9(5):473-6. doi: 10.1038/nmeth.1937.

Systematic protein location mapping reveals five principal chromatin types in Drosophila cells.系统蛋白质定位图谱揭示了果蝇细胞中的五种主要染色质类型。

Cell. 2010 Oct 15;143(2):212-24. doi: 10.1016/j.cell.2010.09.009. Epub 2010 Sep 30.

Discovery and characterization of chromatin states for systematic annotation of the human genome.发现和描述染色质状态，用于系统注释人类基因组。

Nat Biotechnol. 2010 Aug;28(8):817-25. doi: 10.1038/nbt.1662. Epub 2010 Jul 25.

Evolutionarily conserved replication timing profiles predict long-range chromatin interactions and distinguish closely related cell types.进化保守的复制时间图谱可预测长距离染色质相互作用，并区分密切相关的细胞类型。

Genome Res. 2010 Jun;20(6):761-70. doi: 10.1101/gr.099655.109. Epub 2010 Apr 29.

Comprehensive mapping of long-range interactions reveals folding principles of the human genome.远距离相互作用的全面图谱揭示了人类基因组的折叠原理。

Science. 2009 Oct 9;326(5950):289-93. doi: 10.1126/science.1181369.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验