• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蛋白质中固有无序区域的氨基酸取代评分矩阵。

Amino acid substitution scoring matrices specific to intrinsically disordered regions in proteins.

机构信息

Laboratory of Computational Biology, Centre for DNA Fingerprinting and Diagnostics, Uppal, Hyderabad, Telangana, 500039, India.

Graduate School, Manipal Academy of Higher Education, Manipal, Karnataka, 576104, India.

出版信息

Sci Rep. 2019 Nov 8;9(1):16380. doi: 10.1038/s41598-019-52532-8.

DOI:10.1038/s41598-019-52532-8
PMID:31704957
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6841959/
Abstract

An amino acid substitution scoring matrix encapsulates the rates at which various amino acid residues in proteins are substituted by other amino acid residues, over time. Database search methods make use of substitution scoring matrices to identify sequences with homologous relationships. However, widely used substitution scoring matrices, such as BLOSUM series, have been developed using aligned blocks that are mostly devoid of disordered regions in proteins. Hence, these substitution-scoring matrices are mostly inappropriate for homology searches involving proteins enriched with disordered regions as the disordered regions have distinct amino acid compositional bias, and therefore expected to have undergone amino acid substitutions that are distinct from those in the ordered regions. We, therefore, developed a novel series of substitution scoring matrices referred to as EDSSMat by exclusively considering the substitution frequencies of amino acids in the disordered regions of the eukaryotic proteins. The newly developed matrices were tested for their ability to detect homologs of proteins enriched with disordered regions by means of SSEARCH tool. The results unequivocally demonstrate that EDSSMat matrices detect more number of homologs than the widely used BLOSUM, PAM and other standard matrices, indicating their utility value for homology searches of intrinsically disordered proteins.

摘要

氨基酸取代评分矩阵概括了蛋白质中各种氨基酸残基随时间被其他氨基酸残基取代的速率。数据库搜索方法利用取代评分矩阵来识别具有同源关系的序列。然而,广泛使用的取代评分矩阵,如 BLOSUM 系列,是使用主要缺乏蛋白质中无规卷曲区域的对齐块开发的。因此,这些取代评分矩阵对于富含无规卷曲区域的蛋白质的同源性搜索不太合适,因为无规卷曲区域具有独特的氨基酸组成偏向性,因此预计会发生与有序区域不同的氨基酸取代。因此,我们专门考虑了真核蛋白质无规卷曲区域中氨基酸的取代频率,开发了一系列新的取代评分矩阵,称为 EDSSMat。我们使用 SSEARCH 工具测试了新开发的矩阵检测富含无规卷曲区域的蛋白质同源物的能力。结果明确表明,EDSSMat 矩阵比广泛使用的 BLOSUM、PAM 和其他标准矩阵检测到更多数量的同源物,这表明它们对固有无序蛋白质的同源性搜索具有实用价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/902a/6841959/bd0268d7cc7f/41598_2019_52532_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/902a/6841959/70c6047f0e0f/41598_2019_52532_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/902a/6841959/9c685871f1c7/41598_2019_52532_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/902a/6841959/29c0b9103621/41598_2019_52532_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/902a/6841959/7b070371b016/41598_2019_52532_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/902a/6841959/5bbaf5a107bd/41598_2019_52532_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/902a/6841959/bd0268d7cc7f/41598_2019_52532_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/902a/6841959/70c6047f0e0f/41598_2019_52532_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/902a/6841959/9c685871f1c7/41598_2019_52532_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/902a/6841959/29c0b9103621/41598_2019_52532_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/902a/6841959/7b070371b016/41598_2019_52532_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/902a/6841959/5bbaf5a107bd/41598_2019_52532_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/902a/6841959/bd0268d7cc7f/41598_2019_52532_Fig6_HTML.jpg

相似文献

1
Amino acid substitution scoring matrices specific to intrinsically disordered regions in proteins.蛋白质中固有无序区域的氨基酸取代评分矩阵。
Sci Rep. 2019 Nov 8;9(1):16380. doi: 10.1038/s41598-019-52532-8.
2
Substitution scoring matrices for proteins - An overview.蛋白质替换评分矩阵——概述。
Protein Sci. 2020 Nov;29(11):2150-2163. doi: 10.1002/pro.3954. Epub 2020 Oct 12.
3
An amino acid substitution matrix for protein conformation identification.用于蛋白质构象识别的氨基酸替换矩阵。
J Bioinform Comput Biol. 2006 Jun;4(3):769-82. doi: 10.1142/s0219720006002156.
4
A transition probability model for amino acid substitutions from blocks.一种基于模块的氨基酸替换转换概率模型。
J Comput Biol. 2003;10(6):997-1010. doi: 10.1089/106652703322756195.
5
Selecting the Right Similarity-Scoring Matrix.选择合适的相似性评分矩阵。
Curr Protoc Bioinformatics. 2013;43:3.5.1-3.5.9. doi: 10.1002/0471250953.bi0305s43.
6
Adjusting scoring matrices to correct overextended alignments.调整评分矩阵以纠正过度延伸的比对。
Bioinformatics. 2013 Dec 1;29(23):3007-13. doi: 10.1093/bioinformatics/btt517. Epub 2013 Aug 31.
7
Robust sequence alignment using evolutionary rates coupled with an amino acid substitution matrix.使用进化速率结合氨基酸替换矩阵进行稳健的序列比对。
BMC Bioinformatics. 2015 Aug 14;16:255. doi: 10.1186/s12859-015-0688-8.
8
GPCRtm: An amino acid substitution matrix for the transmembrane region of class A G Protein-Coupled Receptors.GPCRtm:一种用于A类G蛋白偶联受体跨膜区的氨基酸替换矩阵。
BMC Bioinformatics. 2015 Jul 2;16:206. doi: 10.1186/s12859-015-0639-4.
9
Optimizing amino acid substitution matrices with a local alignment kernel.使用局部比对核优化氨基酸替换矩阵。
BMC Bioinformatics. 2006 May 5;7:246. doi: 10.1186/1471-2105-7-246.
10
Context-specific amino acid substitution matrices and their use in the detection of protein homologs.特定上下文氨基酸替换矩阵及其在蛋白质同源物检测中的应用。
Proteins. 2008 May 1;71(2):910-9. doi: 10.1002/prot.21775.

引用本文的文献

1
PairK: Pairwise k-mer alignment for quantifying protein motif conservation in disordered regions.PairK:用于量化无序区域中蛋白质基序保守性的成对k-mer比对
Protein Sci. 2025 Jan;34(1):e70004. doi: 10.1002/pro.70004.
2
tcrBLOSUM: an amino acid substitution matrix for sensitive alignment of distant epitope-specific TCRs.tcrBLOSUM:一种氨基酸替换矩阵,用于灵敏比对远距离表位特异性 TCR。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae602.
3
SHARK enables sensitive detection of evolutionary homologs and functional analogs in unalignable and disordered sequences.

本文引用的文献

1
Alignment Modulates Ancestral Sequence Reconstruction Accuracy.比对方式调节祖先序列重建准确性。
Mol Biol Evol. 2018 Jul 1;35(7):1783-1797. doi: 10.1093/molbev/msy055.
2
UniProt: the universal protein knowledgebase.通用蛋白质知识库:UniProt
Nucleic Acids Res. 2018 Mar 16;46(5):2699. doi: 10.1093/nar/gky092.
3
Evolution of intrinsic disorder in eukaryotic proteins.真核生物蛋白质内在无序性的演变
SHARK 能够在不可比对和无序序列中灵敏地检测进化同源物和功能类似物。
Proc Natl Acad Sci U S A. 2024 Oct 15;121(42):e2401622121. doi: 10.1073/pnas.2401622121. Epub 2024 Oct 9.
4
Sex chromosome-encoded protein homologs: current progress and open questions.性染色体编码蛋白同源物:当前的进展和未解决的问题。
Nat Struct Mol Biol. 2024 Aug;31(8):1156-1166. doi: 10.1038/s41594-024-01362-y. Epub 2024 Aug 9.
5
PairK: Pairwise k-mer alignment for quantifying protein motif conservation in disordered regions.PairK:用于量化无序区域中蛋白质基序保守性的成对k-mer比对
bioRxiv. 2024 Jul 24:2024.07.23.604860. doi: 10.1101/2024.07.23.604860.
6
Engineering G protein-coupled receptors for stabilization.工程化稳定化 G 蛋白偶联受体。
Protein Sci. 2024 Jun;33(6):e5000. doi: 10.1002/pro.5000.
7
The Evolution of Temperature and Desiccation-Related Protein Families in Tardigrada Reveals a Complex Acquisition of Extremotolerance.缓步动物门中温度和干燥相关蛋白家族的进化揭示了其对极端环境耐受性的复杂获得过程。
Genome Biol Evol. 2024 Jan 5;16(1). doi: 10.1093/gbe/evad217.
8
New alignment method for remote protein sequences by the direct use of pairwise sequence correlations and substitutions.通过直接利用成对序列相关性和替换来对远程蛋白质序列进行新的比对方法。
Front Bioinform. 2023 Oct 12;3:1227193. doi: 10.3389/fbinf.2023.1227193. eCollection 2023.
9
CapsNet-MHC predicts peptide-MHC class I binding based on capsule neural networks.CapsNet-MHC 基于胶囊神经网络预测肽-MHC I 类结合。
Commun Biol. 2023 May 5;6(1):492. doi: 10.1038/s42003-023-04867-2.
10
Intrinsically Disordered Proteins: An Overview.无规卷曲蛋白:概述。
Int J Mol Sci. 2022 Nov 14;23(22):14050. doi: 10.3390/ijms232214050.
Cell Mol Life Sci. 2017 Sep;74(17):3163-3174. doi: 10.1007/s00018-017-2559-0. Epub 2017 Jun 8.
4
What's in a name? Why these proteins are intrinsically disordered: Why these proteins are intrinsically disordered.名字里有什么?为什么这些蛋白质是内在无序的:为什么这些蛋白质是内在无序的。
Intrinsically Disord Proteins. 2013 Apr 1;1(1):e24157. doi: 10.4161/idp.24157. eCollection 2013 Jan-Dec.
5
The Nuanced Interplay of Intrinsic Disorder and Other Structural Properties Driving Protein Evolution.内在无序与其他结构特性共同作用驱动蛋白质进化。
Mol Biol Evol. 2016 Sep;33(9):2248-56. doi: 10.1093/molbev/msw092. Epub 2016 May 5.
6
Addressing inaccuracies in BLOSUM computation improves homology search performance.解决BLOSUM计算中的不准确问题可提高同源性搜索性能。
BMC Bioinformatics. 2016 Apr 27;17:189. doi: 10.1186/s12859-016-1060-3.
7
KMAD: knowledge-based multiple sequence alignment for intrinsically disordered proteins.KMAD:用于内在无序蛋白质的基于知识的多序列比对
Bioinformatics. 2016 Mar 15;32(6):932-6. doi: 10.1093/bioinformatics/btv663. Epub 2015 Nov 14.
8
UniProtKB/Swiss-Prot, the Manually Annotated Section of the UniProt KnowledgeBase: How to Use the Entry View.UniProtKB/Swiss-Prot,即UniProt知识库的人工注释部分:如何使用条目视图。
Methods Mol Biol. 2016;1374:23-54. doi: 10.1007/978-1-4939-3167-5_2.
9
Parameterized BLOSUM Matrices for Protein Alignment.用于蛋白质比对的参数化BLOSUM矩阵
IEEE/ACM Trans Comput Biol Bioinform. 2015 May-Jun;12(3):686-94. doi: 10.1109/TCBB.2014.2366126.
10
Comprehensive large-scale assessment of intrinsic protein disorder.对内在蛋白质无序状态的全面大规模评估。
Bioinformatics. 2015 Jan 15;31(2):201-8. doi: 10.1093/bioinformatics/btu625. Epub 2014 Sep 21.