• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

关于蛋白质遗传结构的分析:对《蛋白质序列格局并非如此简单》的回应

On the Analysis of Protein Genetic Architecture: Response to "Protein sequence landscapes are not so simple".

作者信息

Park Yeonwoo, Metzger Brian P H, Thornton Joseph W

机构信息

Committee on Genetics, Genomics, and Systems Biology, University of Chicago, Chicago, IL, USA.

Current affiliation: Center for RNA Research, Institute for Basic Science, Seoul, Republic of Korea.

出版信息

bioRxiv. 2024 Dec 6:2024.09.17.613512. doi: 10.1101/2024.09.17.613512.

DOI:10.1101/2024.09.17.613512
PMID:39677708
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11642768/
Abstract

We recently reanalyzed 20 combinatorial mutagenesis datasets using a novel reference-free analysis (RFA) method and showed that high-order epistasis contributes negligibly to protein sequence-function relationships in every case. Dupic, Phillips, and Desai (DPD) commented on a preprint of our work. In our published paper, we addressed all the major issues they raised, but we respond directly to them here. 1) DPD's claim that RFA is equivalent to estimating reference-based analysis (RBA) models by regression neglects fundamental differences in how the two formalisms dissect the causal relationship between sequence and function. It also misinterprets the observation that using regression to estimate any truncated model of genetic architecture will always yield the same predicted phenotypes and variance partition; the resulting estimates correspond to those of the RFA formalism but are inaccurate representations of the true RBA model. 2) DPD's claim that high-order epistasis is widespread and significant while somehow explaining little phenotypic variance is an artifact of two strong biases in the use of regression to estimate RBA models: this procedure underestimates the phenotypic variance explained by RBA epistatic terms while at the same time inflating the magnitude of individual terms. 3) DPD erroneously claim that RFA is "exactly equivalent" to Fourier analysis (FA) and background-averaged analysis (BA). This error arises because DPD used an incorrect mathematical definition of RFA and were misled by a simple numerical relationship among the models that only holds only for the simplest kinds of datasets. 4) DPD argue that using a nonlinear transformation to account for global nonlinearities in sequence-function relationships is often unnecessary and may artifactually absorb specific epistatic interactions. We show that nonspecific epistasis caused by a limited dynamic range affects datasets of all types, even when the phenotype is represented on a free-energy scale. Moreover, using a nonlinear transformation in a joint fitting procedure does not underestimate specific epistasis under realistic conditions, even if the data are not affected by nonspecific epistasis. The conclusions of our work therefore hold: the genetic architecture of all 20 protein datasets we analyzed can be efficiently and accurately described in an RFA framework by first-order amino acid effects and pairwise interactions with a simple model of global nonlinearity. We are grateful for DPD's commentary, which helped us improve our paper.

摘要

我们最近使用一种新颖的无参考分析(RFA)方法重新分析了20个组合诱变数据集,并表明在每种情况下高阶上位性对蛋白质序列-功能关系的贡献可忽略不计。杜皮克、菲利普斯和德赛(DPD)对我们工作的一篇预印本发表了评论。在我们已发表的论文中,我们解决了他们提出的所有主要问题,但在此直接回应他们。1)DPD声称RFA等同于通过回归估计基于参考的分析(RBA)模型,这忽略了两种形式体系剖析序列与功能之间因果关系方式的根本差异。这也误解了这样一种观察结果,即使用回归来估计遗传结构的任何截断模型总是会产生相同的预测表型和方差划分;所得估计对应于RFA形式体系的估计,但并非真实RBA模型的准确表示。2)DPD声称高阶上位性广泛且显著,但在某种程度上解释的表型方差却很少,这是在使用回归估计RBA模型时两种强烈偏差造成的假象:此过程低估了RBA上位性项所解释的表型方差,同时又夸大了单个项的大小。3)DPD错误地声称RFA与傅里叶分析(FA)和背景平均分析(BA)“完全等效”。这个错误的产生是因为DPD使用了RFA的错误数学定义,并被模型之间仅对最简单类型的数据集成立的简单数值关系误导。4)DPD认为使用非线性变换来考虑序列-功能关系中的全局非线性通常是不必要的,并且可能会人为地吸收特定的上位性相互作用。我们表明,即使表型以自由能尺度表示,由有限动态范围引起的非特异性上位性也会影响所有类型的数据集。此外,在联合拟合过程中使用非线性变换在现实条件下不会低估特定上位性,即使数据不受非特异性上位性影响。因此,我们工作的结论仍然成立:通过一阶氨基酸效应以及与简单全局非线性模型的成对相互作用,我们分析的所有20个蛋白质数据集的遗传结构都可以在RFA框架中得到有效且准确的描述。我们感谢DPD的评论,它帮助我们改进了论文。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63a9/11642768/9df821b83c31/nihpp-2024.09.17.613512v2-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63a9/11642768/dd5ff0c00499/nihpp-2024.09.17.613512v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63a9/11642768/8c295e0476a3/nihpp-2024.09.17.613512v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63a9/11642768/79de44a45402/nihpp-2024.09.17.613512v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63a9/11642768/4bb6c021466e/nihpp-2024.09.17.613512v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63a9/11642768/5669dd27ce06/nihpp-2024.09.17.613512v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63a9/11642768/9df821b83c31/nihpp-2024.09.17.613512v2-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63a9/11642768/dd5ff0c00499/nihpp-2024.09.17.613512v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63a9/11642768/8c295e0476a3/nihpp-2024.09.17.613512v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63a9/11642768/79de44a45402/nihpp-2024.09.17.613512v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63a9/11642768/4bb6c021466e/nihpp-2024.09.17.613512v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63a9/11642768/5669dd27ce06/nihpp-2024.09.17.613512v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63a9/11642768/9df821b83c31/nihpp-2024.09.17.613512v2-f0006.jpg

相似文献

1
On the Analysis of Protein Genetic Architecture: Response to "Protein sequence landscapes are not so simple".关于蛋白质遗传结构的分析:对《蛋白质序列格局并非如此简单》的回应
bioRxiv. 2024 Dec 6:2024.09.17.613512. doi: 10.1101/2024.09.17.613512.
2
The simplicity of protein sequence-function relationships.蛋白质序列与功能关系的简单性。
Nat Commun. 2024 Sep 11;15(1):7953. doi: 10.1038/s41467-024-51895-5.
3
Protein sequence landscapes are not so simple: on reference-free versus reference-based inference.蛋白质序列格局并非如此简单:关于无参考与基于参考的推断。
bioRxiv. 2024 Jan 30:2024.01.29.577800. doi: 10.1101/2024.01.29.577800.
4
The simplicity of protein sequence-function relationships.蛋白质序列与功能关系的简单性。
bioRxiv. 2024 Feb 7:2023.09.02.556057. doi: 10.1101/2023.09.02.556057.
5
Importance of higher-order epistasis in large protein sequence-function relationships.高阶上位性在大型蛋白质序列-功能关系中的重要性。
bioRxiv. 2024 Sep 24:2024.09.22.614318. doi: 10.1101/2024.09.22.614318.
6
Orthogonal Estimates of Variances for Additive, Dominance, and Epistatic Effects in Populations.群体中加性、显性和上位性效应方差的正交估计
Genetics. 2017 Jul;206(3):1297-1307. doi: 10.1534/genetics.116.199406. Epub 2017 May 18.
7
Epistasis facilitates functional evolution in an ancient transcription factor.上位性促进了一个古老转录因子的功能进化。
Elife. 2024 May 20;12:RP88737. doi: 10.7554/eLife.88737.
8
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象:化学与物理邂逅生物学(瑞士阿斯科纳,2012年6月10日至14日)
Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.
9
Dominance and epistatic genetic variances for litter size in pigs using genomic models.利用基因组模型估计猪窝产仔数的显性和上位遗传方差。
Genet Sel Evol. 2018 Dec 22;50(1):71. doi: 10.1186/s12711-018-0437-3.
10
Detecting High-Order Epistasis in Nonlinear Genotype-Phenotype Maps.检测非线性基因型-表型图谱中的高阶上位性
Genetics. 2017 Mar;205(3):1079-1088. doi: 10.1534/genetics.116.195214. Epub 2017 Jan 18.

本文引用的文献

1
The simplicity of protein sequence-function relationships.蛋白质序列与功能关系的简单性。
Nat Commun. 2024 Sep 11;15(1):7953. doi: 10.1038/s41467-024-51895-5.
2
Epistasis facilitates functional evolution in an ancient transcription factor.上位性促进了一个古老转录因子的功能进化。
Elife. 2024 May 20;12:RP88737. doi: 10.7554/eLife.88737.
3
Diversification of DNA-Binding Specificity by Permissive and Specificity-Switching Mutations in the ParB/Noc Protein Family.ParB/Noc 蛋白家族中允许和特异性转换突变对 DNA 结合特异性的多样化。
Cell Rep. 2020 Jul 21;32(3):107928. doi: 10.1016/j.celrep.2020.107928.
4
Inferring the shape of global epistasis.推断全球上位性的形状。
Proc Natl Acad Sci U S A. 2018 Aug 7;115(32):E7550-E7558. doi: 10.1073/pnas.1804015115. Epub 2018 Jul 23.
5
The Context-Dependence of Mutations: A Linkage of Formalisms.突变的上下文依赖性:形式主义的一种联系
PLoS Comput Biol. 2016 Jun 23;12(6):e1004771. doi: 10.1371/journal.pcbi.1004771. eCollection 2016 Jun.
6
Delayed commitment to evolutionary fate in antibiotic resistance fitness landscapes.抗生素耐药适应性景观中进化命运的延迟承诺
Nat Commun. 2015 Jun 10;6:7385. doi: 10.1038/ncomms8385.
7
Inferring fitness landscapes by regression produces biased estimates of epistasis.通过回归推断适应度景观会产生偏倚的上位性估计。
Proc Natl Acad Sci U S A. 2014 Jun 3;111(22):E2301-9. doi: 10.1073/pnas.1400849111. Epub 2014 May 19.
8
Darwinian evolution can follow only very few mutational paths to fitter proteins.达尔文式进化只能沿着极少数的突变路径产生更适应的蛋白质。
Science. 2006 Apr 7;312(5770):111-4. doi: 10.1126/science.1123539.