• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于蛋白质折叠识别的简化氨基酸字母表及其对折叠的影响。

Simplified amino acid alphabets for protein fold recognition and implications for folding.

作者信息

Murphy L R, Wallqvist A, Levy R M

机构信息

Department of Chemistry, Rutgers University, Wright-Rieman Laboratories, 610 Taylor Road, Piscataway, NJ 08854-8087, USA.

出版信息

Protein Eng. 2000 Mar;13(3):149-52. doi: 10.1093/protein/13.3.149.

DOI:10.1093/protein/13.3.149
PMID:10775656
Abstract

Protein design experiments have shown that the use of specific subsets of amino acids can produce foldable proteins. This prompts the question of whether there is a minimal amino acid alphabet which could be used to fold all proteins. In this work we make an analogy between sequence patterns which produce foldable sequences and those which make it possible to detect structural homologs by aligning sequences, and use it to suggest the possible size of such a reduced alphabet. We estimate that reduced alphabets containing 10-12 letters can be used to design foldable sequences for a large number of protein families. This estimate is based on the observation that there is little loss of the information necessary to pick out structural homologs in a clustered protein sequence database when a suitable reduction of the amino acid alphabet from 20 to 10 letters is made, but that this information is rapidly degraded when further reductions in the alphabet are made.

摘要

蛋白质设计实验表明,使用特定的氨基酸子集可以产生可折叠的蛋白质。这就引发了一个问题:是否存在一个最小的氨基酸字母表,可用于折叠所有蛋白质。在这项工作中,我们将产生可折叠序列的序列模式与通过序列比对来检测结构同源物的序列模式进行类比,并利用它来推测这种简化字母表可能的大小。我们估计,包含10至12个字母的简化字母表可用于为大量蛋白质家族设计可折叠序列。这一估计基于这样的观察结果:当将氨基酸字母表从20个适当减少到10个字母时,在聚类蛋白质序列数据库中挑选结构同源物所需的信息几乎没有损失,但当进一步减少字母表时,这些信息会迅速退化。

相似文献

1
Simplified amino acid alphabets for protein fold recognition and implications for folding.用于蛋白质折叠识别的简化氨基酸字母表及其对折叠的影响。
Protein Eng. 2000 Mar;13(3):149-52. doi: 10.1093/protein/13.3.149.
2
Accuracy of sequence alignment and fold assessment using reduced amino acid alphabets.使用简化氨基酸字母表进行序列比对和折叠评估的准确性。
Proteins. 2006 Jun 1;63(4):986-95. doi: 10.1002/prot.20881.
3
Reduced alphabet for protein folding prediction.用于蛋白质折叠预测的简化字母表。
Proteins. 2015 Apr;83(4):631-9. doi: 10.1002/prot.24762. Epub 2015 Feb 5.
4
Amino acid alphabet reduction preserves fold information contained in contact interactions in proteins.氨基酸字母表精简保留了蛋白质中接触相互作用所包含的折叠信息。
Proteins. 2015 Dec;83(12):2198-216. doi: 10.1002/prot.24936.
5
Reduced amino acid alphabets exhibit an improved sensitivity and selectivity in fold assignment.简化氨基酸字母表在折叠分配中表现出更高的灵敏度和选择性。
Bioinformatics. 2009 Jun 1;25(11):1356-62. doi: 10.1093/bioinformatics/btp164. Epub 2009 Apr 7.
6
What is the minimum number of letters required to fold a protein?折叠一个蛋白质所需的最少字母数是多少?
J Mol Biol. 2003 May 9;328(4):921-6. doi: 10.1016/s0022-2836(03)00324-3.
7
Grouping of amino acids and recognition of protein structurally conserved regions by reduced alphabets of amino acids.通过氨基酸简化字母表对氨基酸进行分组并识别蛋白质结构保守区域。
Sci China C Life Sci. 2007 Jun;50(3):392-402. doi: 10.1007/s11427-007-0023-3.
8
Reduced alphabet of prebiotic amino acids optimally encodes the conformational space of diverse extant protein folds.简化的前生物氨基酸字母表最优地编码了不同现存蛋白质折叠的构象空间。
BMC Evol Biol. 2019 Jul 30;19(1):158. doi: 10.1186/s12862-019-1464-6.
9
Automated alphabet reduction for protein datasets.蛋白质数据集的自动字母缩减
BMC Bioinformatics. 2009 Jan 6;10:6. doi: 10.1186/1471-2105-10-6.
10
Reduction of protein sequence complexity by residue grouping.通过残基分组降低蛋白质序列复杂性。
Protein Eng. 2003 May;16(5):323-30. doi: 10.1093/protein/gzg044.

引用本文的文献

1
TetRex: a novel algorithm for index-accelerated search of highly conserved motifs.TetRex:一种用于高度保守基序索引加速搜索的新算法。
NAR Genom Bioinform. 2025 Apr 17;7(2):lqaf039. doi: 10.1093/nargab/lqaf039. eCollection 2025 Jun.
2
Accelerated enzyme engineering by machine-learning guided cell-free expression.通过机器学习引导的无细胞表达实现加速酶工程。
Nat Commun. 2025 Jan 20;16(1):865. doi: 10.1038/s41467-024-55399-0.
3
BAD2matrix: Phylogenomic matrix concatenation, indel coding, and more.BAD2矩阵:系统发育基因组矩阵拼接、插入缺失编码及更多内容。
Appl Plant Sci. 2024 Sep 24;12(6):e11604. doi: 10.1002/aps3.11604. eCollection 2024 Nov-Dec.
4
Limits on inferring T cell specificity from partial information.从部分信息推断 T 细胞特异性的限制。
Proc Natl Acad Sci U S A. 2024 Oct 15;121(42):e2408696121. doi: 10.1073/pnas.2408696121. Epub 2024 Oct 7.
5
Discovery of antimicrobial peptides in the global microbiome with machine learning.利用机器学习在全球微生物组中发现抗菌肽。
Cell. 2024 Jul 11;187(14):3761-3778.e16. doi: 10.1016/j.cell.2024.05.013. Epub 2024 Jun 5.
6
Lambda3: homology search for protein, nucleotide, and bisulfite-converted sequences.Lambda3:蛋白质、核苷酸和亚硫酸氢盐转化序列的同源性搜索。
Bioinformatics. 2024 Mar 4;40(3). doi: 10.1093/bioinformatics/btae097.
7
Protein language models meet reduced amino acid alphabets.蛋白质语言模型与简化的氨基酸字母表相遇。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae061.
8
Computational exploration of the global microbiome for antibiotic discovery.用于抗生素发现的全球微生物组的计算探索。
bioRxiv. 2023 Sep 11:2023.08.31.555663. doi: 10.1101/2023.08.31.555663.
9
Embedded-AMP: A Multi-Thread Computational Method for the Systematic Identification of Antimicrobial Peptides Embedded in Proteome Sequences.嵌入式抗菌肽:一种用于系统鉴定蛋白质组序列中嵌入式抗菌肽的多线程计算方法。
Antibiotics (Basel). 2023 Jan 10;12(1):139. doi: 10.3390/antibiotics12010139.
10
Organizing the bacterial annotation space with amino acid sequence embeddings.利用氨基酸序列嵌入来组织细菌注释空间。
BMC Bioinformatics. 2022 Sep 23;23(1):385. doi: 10.1186/s12859-022-04930-5.