对称性、规范自由度以及序列-功能关系的可解释性。

Symmetry, gauge freedoms, and the interpretability of sequence-function relationships.

作者信息

Posfai Anna, McCandlish David M, Kinney Justin B

机构信息

Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory.

出版信息

bioRxiv. 2025 Mar 17:2024.05.12.593774. doi: 10.1101/2024.05.12.593774.

DOI:10.1101/2024.05.12.593774

PMID:38798625

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11118426/

Abstract

Quantitative models that describe how biological sequences encode functional activities are ubiquitous in modern biology. One important aspect of these models is that they commonly exhibit gauge freedoms, i.e., directions in parameter space that do not affect model predictions. In physics, gauge freedoms arise when physical theories are formulated in ways that respect fundamental symmetries. However, the connections that gauge freedoms in models of sequence-function relationships have to the symmetries of sequence space have yet to be systematically studied. In this work we study the gauge freedoms of models that respect a specific symmetry of sequence space: the group of position-specific character permutations. We find that gauge freedoms arise when model parameters transform under redundant irreducible matrix representations of this group. Based on this finding, we describe an "embedding distillation" procedure that enables both analytic calculation of the number of independent gauge freedoms and efficient computation of a sparse basis for the space of gauge freedoms. We also study how parameter transformation behavior affects parameter interpretability. We find that in many (and possibly all) nontrivial models, the ability to interpret individual model parameters as quantifying intrinsic allelic effects requires that gauge freedoms be present. This finding establishes an incompatibility between two distinct notions of parameter interpretability. Our work thus advances the understanding of symmetries, gauge freedoms, and parameter interpretability in models of sequence-function relationships.

摘要

描述生物序列如何编码功能活性的定量模型在现代生物学中无处不在。这些模型的一个重要方面是它们通常表现出规范自由度，即在参数空间中不影响模型预测的方向。在物理学中，当物理理论以尊重基本对称性的方式表述时，就会出现规范自由度。然而，序列 - 功能关系模型中的规范自由度与序列空间对称性之间的联系尚未得到系统研究。在这项工作中，我们研究了尊重序列空间特定对称性的模型的规范自由度：位置特异性字符置换群。我们发现，当模型参数在该群的冗余不可约矩阵表示下变换时，就会出现规范自由度。基于这一发现，我们描述了一种“嵌入蒸馏”程序，该程序既能对独立规范自由度的数量进行解析计算，又能高效计算规范自由度空间的稀疏基。我们还研究了参数变换行为如何影响参数的可解释性。我们发现，在许多（可能是所有）非平凡模型中，将单个模型参数解释为量化内在等位基因效应的能力要求存在规范自由度。这一发现确立了两种不同的参数可解释性概念之间的不相容性。因此，我们的工作推进了对序列 - 功能关系模型中的对称性、规范自由度和参数可解释性的理解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a84d/11956585/8770953c8332/nihpp-2024.05.12.593774v3-f0001.jpg

相似文献

Symmetry, gauge freedoms, and the interpretability of sequence-function relationships.

bioRxiv. 2025 Mar 17:2024.05.12.593774. doi: 10.1101/2024.05.12.593774.

Symmetry, gauge freedoms, and the interpretability of sequence-function relationships.

Phys Rev Res. 2025 Apr-Jun;7(2). doi: 10.1103/physrevresearch.7.023005. Epub 2025 Apr 2.

Prescription of Controlled Substances: Benefits and Risks

The Lived Experience of Autistic Adults in Employment: A Systematic Search and Synthesis.

Autism Adulthood. 2024 Dec 2;6(4):495-509. doi: 10.1089/aut.2022.0114. eCollection 2024 Dec.

Aspects of Genetic Diversity, Host Specificity and Public Health Significance of Single-Celled Intestinal Parasites Commonly Observed in Humans and Mostly Referred to as 'Non-Pathogenic'.

APMIS. 2025 Sep;133(9):e70036. doi: 10.1111/apm.70036.

Sexual Harassment and Prevention Training

Gravity generated by four one-dimensional unitary gauge symmetries and the Standard Model.

Rep Prog Phys. 2025 May 2;88(5). doi: 10.1088/1361-6633/adc82e.

Short-Term Memory Impairment

"I Don't Understand Their Sense of Belonging": Exploring How Nonbinary Autistic Adults Experience Gender.

Autism Adulthood. 2024 Dec 2;6(4):462-473. doi: 10.1089/aut.2023.0071. eCollection 2024 Dec.

"In a State of Flow": A Qualitative Examination of Autistic Adults' Phenomenological Experiences of Task Immersion.

Autism Adulthood. 2024 Sep 16;6(3):362-373. doi: 10.1089/aut.2023.0032. eCollection 2024 Sep.

引用本文的文献

Gauge fixing for sequence-function relationships.

PLoS Comput Biol. 2025 Mar 20;21(3):e1012818. doi: 10.1371/journal.pcbi.1012818. eCollection 2025.

本文引用的文献

Gauge fixing for sequence-function relationships.

PLoS Comput Biol. 2025 Mar 20;21(3):e1012818. doi: 10.1371/journal.pcbi.1012818. eCollection 2025.

Interpreting -regulatory mechanisms from genomic deep neural networks using surrogate models.

Nat Mach Intell. 2024 Jun;6(6):701-713. doi: 10.1038/s42256-024-00851-5. Epub 2024 Jun 21.

Interpretable pairwise distillations for generative protein sequence models.

PLoS Comput Biol. 2022 Jun 23;18(6):e1010219. doi: 10.1371/journal.pcbi.1010219. eCollection 2022 Jun.

Prediction of protein-ligand binding affinity from sequencing data with interpretable machine learning.

Nat Biotechnol. 2022 Oct;40(10):1520-1527. doi: 10.1038/s41587-022-01307-0. Epub 2022 May 23.

Correlations from structure and phylogeny combine constructively in the inference of protein partners from sequences.

PLoS Comput Biol. 2022 May 16;18(5):e1010147. doi: 10.1371/journal.pcbi.1010147. eCollection 2022 May.

MAVE-NN: learning genotype-phenotype maps from multiplex assays of variant effect.

Genome Biol. 2022 Apr 15;23(1):98. doi: 10.1186/s13059-022-02661-7.

Learning protein fitness models from evolutionary and assay-labeled data.

Nat Biotechnol. 2022 Jul;40(7):1114-1122. doi: 10.1038/s41587-021-01146-5. Epub 2022 Jan 17.

On the sparsity of fitness functions and implications for learning.

Proc Natl Acad Sci U S A. 2022 Jan 4;119(1). doi: 10.1073/pnas.2109649118.

Massively Parallel Assays and Quantitative Sequence-Function Relationships.

Annu Rev Genomics Hum Genet. 2019 Aug 31;20:99-127. doi: 10.1146/annurev-genom-083118-014845. Epub 2019 May 15.

Influence of multiple-sequence-alignment depth on Potts statistical models of protein covariation.

Phys Rev E. 2019 Mar;99(3-1):032405. doi: 10.1103/PhysRevE.99.032405.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

对称性、规范自由度以及序列-功能关系的可解释性。

Symmetry, gauge freedoms, and the interpretability of sequence-function relationships.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献