• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

来自人类特异性氨基酸替换矩阵的进化与功能启示。

Evolutionary and functional lessons from human-specific amino acid substitution matrices.

作者信息

Shauli Tair, Brandes Nadav, Linial Michal

机构信息

The Rachel and Selim Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, 91904, Israel.

Department of Biological Chemistry, Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, 91904, Israel.

出版信息

NAR Genom Bioinform. 2021 Sep 16;3(3):lqab079. doi: 10.1093/nargab/lqab079. eCollection 2021 Sep.

DOI:10.1093/nargab/lqab079
PMID:34541526
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8445205/
Abstract

Human genetic variation in coding regions is fundamental to the study of protein structure and function. Most methods for interpreting missense variants consider substitution measures derived from homologous proteins across different species. In this study, we introduce human-specific amino acid (AA) substitution matrices that are based on genetic variations in the modern human population. We analyzed the frequencies of >4.8M single nucleotide variants (SNVs) at codon and AA resolution and compiled human-centric substitution matrices that are fundamentally different from classic cross-species matrices (e.g. BLOSUM, PAM). Our matrices are asymmetric, with some AA replacements showing significant directional preference. Moreover, these AA matrices are only partly predicted by nucleotide substitution rates. We further test the utility of our matrices in exposing functional signals of experimentally-validated protein annotations. A significant reduction in AA transition frequencies was observed across nine post-translational modification (PTM) types and four ion-binding sites. Our results propose a purifying selection signal in the human proteome across a diverse set of functional protein annotations and provide an empirical baseline for interpreting human genetic variation in coding regions.

摘要

编码区域的人类遗传变异是蛋白质结构与功能研究的基础。大多数解释错义变体的方法都考虑了来自不同物种同源蛋白质的替换度量。在本研究中,我们引入了基于现代人类群体遗传变异的人类特异性氨基酸(AA)替换矩阵。我们在密码子和氨基酸分辨率下分析了超过480万个单核苷酸变体(SNV)的频率,并编制了以人类为中心的替换矩阵,这些矩阵与经典的跨物种矩阵(如BLOSUM、PAM)有根本区别。我们的矩阵是不对称的,一些氨基酸替换显示出明显的方向偏好。此外,这些氨基酸矩阵仅部分由核苷酸替换率预测。我们进一步测试了我们的矩阵在揭示经实验验证的蛋白质注释的功能信号方面的效用。在九种翻译后修饰(PTM)类型和四个离子结合位点中,观察到氨基酸转换频率显著降低。我们的结果提出了人类蛋白质组中跨多种功能蛋白质注释的纯化选择信号,并为解释编码区域的人类遗传变异提供了一个经验基线。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dd7/8445205/3782cc77ab72/lqab079fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dd7/8445205/dc4990d7261b/lqab079fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dd7/8445205/6cd2afdaca27/lqab079fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dd7/8445205/94208beb759d/lqab079fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dd7/8445205/26778594ba72/lqab079fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dd7/8445205/3782cc77ab72/lqab079fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dd7/8445205/dc4990d7261b/lqab079fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dd7/8445205/6cd2afdaca27/lqab079fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dd7/8445205/94208beb759d/lqab079fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dd7/8445205/26778594ba72/lqab079fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dd7/8445205/3782cc77ab72/lqab079fig5.jpg

相似文献

1
Evolutionary and functional lessons from human-specific amino acid substitution matrices.来自人类特异性氨基酸替换矩阵的进化与功能启示。
NAR Genom Bioinform. 2021 Sep 16;3(3):lqab079. doi: 10.1093/nargab/lqab079. eCollection 2021 Sep.
2
Selective constraints on amino acids estimated by a mechanistic codon substitution model with multiple nucleotide changes.通过具有多个核苷酸变化的机制密码子替换模型估计的氨基酸的选择约束。
PLoS One. 2011 Mar 18;6(3):e17244. doi: 10.1371/journal.pone.0017244.
3
Comparison of the PAM and BLOSUM Amino Acid Substitution Matrices.PAM与BLOSUM氨基酸替换矩阵的比较。
CSH Protoc. 2008 Jun 1;2008:pdb.ip59. doi: 10.1101/pdb.ip59.
4
A thermodynamic model of protein structure evolution explains empirical amino acid substitution matrices.蛋白质结构进化的热力学模型解释了经验性的氨基酸替换矩阵。
Protein Sci. 2021 Oct;30(10):2057-2068. doi: 10.1002/pro.4155. Epub 2021 Jul 30.
5
The ranging of amino acids substitution matrices of various types in accordance with the alignment accuracy criterion.根据比对准确性标准对各种类型氨基酸替换矩阵进行排序。
BMC Bioinformatics. 2020 Sep 14;21(Suppl 11):294. doi: 10.1186/s12859-020-03616-0.
6
Advantages of a mechanistic codon substitution model for evolutionary analysis of protein-coding sequences.机械密码子替换模型在蛋白质编码序列进化分析中的优势。
PLoS One. 2011;6(12):e28892. doi: 10.1371/journal.pone.0028892. Epub 2011 Dec 29.
7
Superiority of a mechanistic codon substitution model even for protein sequences in phylogenetic analysis.在系统发育分析中,机械密码子替换模型的优越性甚至体现在蛋白质序列上。
BMC Evol Biol. 2013 Nov 21;13:257. doi: 10.1186/1471-2148-13-257.
8
Substitution scoring matrices for proteins - An overview.蛋白质替换评分矩阵——概述。
Protein Sci. 2020 Nov;29(11):2150-2163. doi: 10.1002/pro.3954. Epub 2020 Oct 12.
9
New algorithm for the analysis of nucleotide and amino acid evolutionary relationships based on Klein four-group.基于 Klein 四组的核苷酸和氨基酸进化关系分析新算法。
Biosystems. 2023 Nov;233:105030. doi: 10.1016/j.biosystems.2023.105030. Epub 2023 Sep 17.
10
Evolutionary constraint and disease associations of post-translational modification sites in human genomes.人类基因组中翻译后修饰位点的进化限制与疾病关联
PLoS Genet. 2015 Jan 22;11(1):e1004919. doi: 10.1371/journal.pgen.1004919. eCollection 2015 Jan.

引用本文的文献

1
Language Modelling Techniques for Analysing the Impact of Human Genetic Variation.用于分析人类基因变异影响的语言建模技术
Bioinform Biol Insights. 2025 Sep 2;19:11779322251358314. doi: 10.1177/11779322251358314. eCollection 2025.
2
PRP: pathogenic risk prediction for rare nonsynonymous single nucleotide variants.PRP:罕见非同义单核苷酸变异的致病风险预测
Hum Genet. 2025 May 29. doi: 10.1007/s00439-025-02751-z.
3
Interpreting protein variant effects with computational predictors and deep mutational scanning.用计算预测器和深度突变扫描来解释蛋白质变异的影响。

本文引用的文献

1
Serine substitutions are linked to codon usage and differ for variable and conserved protein regions.丝氨酸取代与密码子使用有关,并且在可变和保守的蛋白质区域中存在差异。
Sci Rep. 2019 Nov 21;9(1):17238. doi: 10.1038/s41598-019-53452-3.
2
Quantifying gene selection in cancer through protein functional alteration bias.通过蛋白质功能改变偏向量化癌症中的基因选择。
Nucleic Acids Res. 2019 Jul 26;47(13):6642-6655. doi: 10.1093/nar/gkz546.
3
CADD: predicting the deleteriousness of variants throughout the human genome.CADD:预测整个人类基因组中变异的有害性。
Dis Model Mech. 2022 Jun 1;15(6). doi: 10.1242/dmm.049510. Epub 2022 Jun 23.
Nucleic Acids Res. 2019 Jan 8;47(D1):D886-D894. doi: 10.1093/nar/gky1016.
4
A review study: Computational techniques for expecting the impact of non-synonymous single nucleotide variants in human diseases.一篇综述研究:计算技术在预测人类疾病中非 synonymous 单核苷酸变异影响方面的应用。
Gene. 2019 Jan 5;680:20-33. doi: 10.1016/j.gene.2018.09.028. Epub 2018 Sep 18.
5
AWESOME: a database of SNPs that affect protein post-translational modifications.AWESOME:一个影响蛋白质翻译后修饰的 SNP 数据库。
Nucleic Acids Res. 2019 Jan 8;47(D1):D874-D880. doi: 10.1093/nar/gky821.
6
Secondary structure forming sequences drive SD-MMEJ repair of DNA double-strand breaks.二级结构形成序列驱动DNA双链断裂的SD-MMEJ修复。
Nucleic Acids Res. 2017 Dec 15;45(22):12848-12861. doi: 10.1093/nar/gkx1056.
7
Molecular Population Genetics.分子群体遗传学
Genetics. 2017 Mar;205(3):1003-1035. doi: 10.1534/genetics.116.196493.
8
From structure to redox: The diverse functional roles of disulfides and implications in disease.从结构到氧化还原:二硫键的多种功能作用及其在疾病中的意义
Proteomics. 2017 Mar;17(6). doi: 10.1002/pmic.201600391.
9
Variation in the molecular clock of primates.灵长类动物分子钟的变异。
Proc Natl Acad Sci U S A. 2016 Sep 20;113(38):10607-12. doi: 10.1073/pnas.1600374113. Epub 2016 Sep 6.
10
Analysis of protein-coding genetic variation in 60,706 humans.对60706名人类的蛋白质编码基因变异进行分析。
Nature. 2016 Aug 18;536(7616):285-91. doi: 10.1038/nature19057.