• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对称性、规范自由度与序列-功能关系的可解释性。

Symmetry, gauge freedoms, and the interpretability of sequence-function relationships.

作者信息

Posfai Anna, McCandlish David M, Kinney Justin B

机构信息

Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory.

出版信息

Phys Rev Res. 2025 Apr-Jun;7(2). doi: 10.1103/physrevresearch.7.023005. Epub 2025 Apr 2.

DOI:10.1103/physrevresearch.7.023005
PMID:40837489
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12363380/
Abstract

Quantitative models that describe how biological sequences encode functional activities are ubiquitous in modern biology. One important aspect of these models is that they commonly exhibit gauge freedoms, i.e., directions in parameter space that do not affect model predictions. In physics, gauge freedoms arise when physical theories are formulated in ways that respect fundamental symmetries. However, the connections that gauge freedoms in models of sequence-function relationships have to the symmetries of sequence space have yet to be systematically studied. Here we study the gauge freedoms of models that respect a specific symmetry of sequence space: the group of position-specific character permutations. We find that gauge freedoms arise when model parameters transform under redundant irreducible matrix representations of this group. Based on this finding, we describe an "embedding distillation" procedure that enables analytic calculation of the number of independent gauge freedoms, as well as efficient computation of a sparse basis for the space of gauge freedoms. We also study how parameter transformation behavior affects parameter interpretability. We find that in many (and possibly all) nontrivial models, the ability to interpret individual model parameters as quantifying intrinsic allelic effects requires that gauge freedoms be present. This finding establishes an incompatibility between two distinct notions of parameter interpretability. Our work thus advances the understanding of symmetries, gauge freedoms, and parameter interpretability in sequence-function relationships.

摘要

描述生物序列如何编码功能活性的定量模型在现代生物学中无处不在。这些模型的一个重要方面是它们通常表现出规范自由度,即参数空间中不影响模型预测的方向。在物理学中,当物理理论以尊重基本对称性的方式表述时会出现规范自由度。然而,序列 - 功能关系模型中的规范自由度与序列空间对称性之间的联系尚未得到系统研究。在这里,我们研究尊重序列空间特定对称性的模型的规范自由度:位置特异性字符置换群。我们发现,当模型参数在该群的冗余不可约矩阵表示下变换时会出现规范自由度。基于这一发现,我们描述了一种“嵌入蒸馏”程序,该程序能够解析计算独立规范自由度的数量,并有效计算规范自由度空间的稀疏基。我们还研究了参数变换行为如何影响参数可解释性。我们发现,在许多(可能所有)非平凡模型中,将单个模型参数解释为量化内在等位基因效应的能力需要存在规范自由度。这一发现确立了两种不同的参数可解释性概念之间的不相容性。因此,我们的工作推进了对序列 - 功能关系中的对称性、规范自由度和参数可解释性的理解。

相似文献

1
Symmetry, gauge freedoms, and the interpretability of sequence-function relationships.对称性、规范自由度与序列-功能关系的可解释性。
Phys Rev Res. 2025 Apr-Jun;7(2). doi: 10.1103/physrevresearch.7.023005. Epub 2025 Apr 2.
2
Symmetry, gauge freedoms, and the interpretability of sequence-function relationships.对称性、规范自由度以及序列-功能关系的可解释性。
bioRxiv. 2025 Mar 17:2024.05.12.593774. doi: 10.1101/2024.05.12.593774.
3
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
4
The Lived Experience of Autistic Adults in Employment: A Systematic Search and Synthesis.成年自闭症患者的就业生活经历:系统检索与综述
Autism Adulthood. 2024 Dec 2;6(4):495-509. doi: 10.1089/aut.2022.0114. eCollection 2024 Dec.
5
Sexual Harassment and Prevention Training性骚扰与预防培训
6
Gravity generated by four one-dimensional unitary gauge symmetries and the Standard Model.由四种一维幺正规范对称性和标准模型产生的引力。
Rep Prog Phys. 2025 May 2;88(5). doi: 10.1088/1361-6633/adc82e.
7
"In a State of Flow": A Qualitative Examination of Autistic Adults' Phenomenological Experiences of Task Immersion.“心流状态”:对自闭症成年人任务沉浸现象学体验的质性研究
Autism Adulthood. 2024 Sep 16;6(3):362-373. doi: 10.1089/aut.2023.0032. eCollection 2024 Sep.
8
"I Don't Understand Their Sense of Belonging": Exploring How Nonbinary Autistic Adults Experience Gender.“我不理解他们的归属感”:探索非二元性别的自闭症成年人如何体验性别。
Autism Adulthood. 2024 Dec 2;6(4):462-473. doi: 10.1089/aut.2023.0071. eCollection 2024 Dec.
9
Short-Term Memory Impairment短期记忆障碍
10
Gauge fixing for sequence-function relationships.序列-功能关系的规范固定。
bioRxiv. 2024 Jun 24:2024.05.12.593772. doi: 10.1101/2024.05.12.593772.

引用本文的文献

1
On learning functions over biological sequence space: relating Gaussian process priors, regularization, and gauge fixing.关于生物序列空间上的学习函数:关联高斯过程先验、正则化和规范固定。
bioRxiv. 2025 Jul 11:2025.04.26.650699. doi: 10.1101/2025.04.26.650699.
2
On learning functions over biological sequence space: relating Gaussian process priors, regularization, and gauge fixing.关于生物序列空间上的学习函数:关联高斯过程先验、正则化和规范固定。
ArXiv. 2025 Jul 11:arXiv:2504.19034v2.
3
Efficient epistasis inference via higher-order covariance matrix factorization.

本文引用的文献

1
Gauge fixing for sequence-function relationships.序列-功能关系的规范固定
PLoS Comput Biol. 2025 Mar 20;21(3):e1012818. doi: 10.1371/journal.pcbi.1012818. eCollection 2025.
2
Interpreting -regulatory mechanisms from genomic deep neural networks using surrogate models.使用替代模型从基因组深度神经网络解释调控机制。
Nat Mach Intell. 2024 Jun;6(6):701-713. doi: 10.1038/s42256-024-00851-5. Epub 2024 Jun 21.
3
Interpretable pairwise distillations for generative protein sequence models.可解释的成对蒸馏方法用于生成蛋白质序列模型。
通过高阶协方差矩阵分解进行高效上位性推断。
bioRxiv. 2024 Oct 14:2024.10.14.618287. doi: 10.1101/2024.10.14.618287.
4
Gauge fixing for sequence-function relationships.序列-功能关系的规范固定。
bioRxiv. 2024 Jun 24:2024.05.12.593772. doi: 10.1101/2024.05.12.593772.
PLoS Comput Biol. 2022 Jun 23;18(6):e1010219. doi: 10.1371/journal.pcbi.1010219. eCollection 2022 Jun.
4
Prediction of protein-ligand binding affinity from sequencing data with interpretable machine learning.基于可解释机器学习的测序数据预测蛋白-配体结合亲和力。
Nat Biotechnol. 2022 Oct;40(10):1520-1527. doi: 10.1038/s41587-022-01307-0. Epub 2022 May 23.
5
Correlations from structure and phylogeny combine constructively in the inference of protein partners from sequences.结构和系统发生的相关性在从序列推断蛋白质伴侣时具有建设性的结合。
PLoS Comput Biol. 2022 May 16;18(5):e1010147. doi: 10.1371/journal.pcbi.1010147. eCollection 2022 May.
6
MAVE-NN: learning genotype-phenotype maps from multiplex assays of variant effect.MAVE-NN:从变异效应的多重分析中学习基因型-表型图谱。
Genome Biol. 2022 Apr 15;23(1):98. doi: 10.1186/s13059-022-02661-7.
7
Learning protein fitness models from evolutionary and assay-labeled data.从进化和实验标记数据中学习蛋白质适应性模型。
Nat Biotechnol. 2022 Jul;40(7):1114-1122. doi: 10.1038/s41587-021-01146-5. Epub 2022 Jan 17.
8
On the sparsity of fitness functions and implications for learning.关于适应度函数的稀疏性及其对学习的影响。
Proc Natl Acad Sci U S A. 2022 Jan 4;119(1). doi: 10.1073/pnas.2109649118.
9
Massively Parallel Assays and Quantitative Sequence-Function Relationships.大规模平行分析与定量序列功能关系。
Annu Rev Genomics Hum Genet. 2019 Aug 31;20:99-127. doi: 10.1146/annurev-genom-083118-014845. Epub 2019 May 15.
10
Influence of multiple-sequence-alignment depth on Potts statistical models of protein covariation.多序列比对深度对蛋白质共变的 Potts 统计模型的影响。
Phys Rev E. 2019 Mar;99(3-1):032405. doi: 10.1103/PhysRevE.99.032405.