Suppr超能文献

AAindexNC:估算非标准氨基酸的物理化学性质,包括那些源自蛋白质数据库(PDB)和蛋白质数据银行化学数据库(PDBeChem)的非标准氨基酸。

AAindexNC: Estimating the Physicochemical Properties of Non-Canonical Amino Acids, Including Those Derived from the PDB and PDBeChem Databank.

作者信息

Milchevskiy Yury V, Kravatskaya Galina I, Kravatsky Yury V

机构信息

Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Vavilov Str., 32, 119991 Moscow, Russia.

Center for Precision Genome Editing and Genetic Technologies for Biomedicine, Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Vavilov Str., 32, 119991 Moscow, Russia.

出版信息

Int J Mol Sci. 2024 Nov 22;25(23):12555. doi: 10.3390/ijms252312555.

Abstract

The physicochemical properties of amino acid residues from the AAindex database are widely used as predictors in building models for predicting both protein structures and properties. It should be noted, however, that the AAindex database contains data only for the 20 canonical amino acids. Non-canonical amino acids, while less common, are not rare; the Protein Data Bank includes proteins with more than 1000 distinct non-canonical amino acids. In this study, we propose a method to evaluate the physicochemical properties from the AAindex database for non-canonical amino acids and assess the prediction quality. We implemented our method as a bioinformatics tool and estimated the physicochemical properties of non-canonical amino acids from the PDB with the chemical composition presentation using SMILES encoding obtained from the PDBechem databank. The bioinformatics tool and resulting database of the estimated properties are freely available on the author's website and available for download via GitHub.

摘要

来自AAindex数据库的氨基酸残基的物理化学性质被广泛用作构建预测蛋白质结构和性质模型的预测因子。然而,应该注意的是,AAindex数据库仅包含20种标准氨基酸的数据。非标准氨基酸虽然不太常见,但并不罕见;蛋白质数据库中包含具有1000多种不同非标准氨基酸的蛋白质。在本研究中,我们提出了一种方法来评估来自AAindex数据库的非标准氨基酸的物理化学性质,并评估预测质量。我们将我们的方法实现为一种生物信息学工具,并使用从PDBechem数据库获得的SMILES编码,通过化学成分表示法估计了PDB中非标准氨基酸的物理化学性质。该生物信息学工具和由此产生的估计性质数据库可在作者网站上免费获取,并可通过GitHub下载。

相似文献

2
AAindex: amino acid index database, progress report 2008.AAindex:氨基酸索引数据库,2008年进展报告。
Nucleic Acids Res. 2008 Jan;36(Database issue):D202-5. doi: 10.1093/nar/gkm998. Epub 2007 Nov 12.
8
Intrinsic disorder in the Protein Data Bank.蛋白质数据库中的内在无序状态。
J Biomol Struct Dyn. 2007 Feb;24(4):325-42. doi: 10.1080/07391102.2007.10507123.

本文引用的文献

7
Robust genetic codes enhance protein evolvability.稳健的遗传密码增强了蛋白质的可进化性。
PLoS Biol. 2024 May 16;22(5):e3002594. doi: 10.1371/journal.pbio.3002594. eCollection 2024 May.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验