Suppr超能文献

将蛋白质表示为二维空间中的行走。

Representation of proteins as walks in 20-D space.

作者信息

Novic M, Randic M

机构信息

National Institute of Chemistry, Hajdrihova, Ljubljana, Slovenia.

出版信息

SAR QSAR Environ Res. 2008 Apr-Jun;19(3-4):317-37. doi: 10.1080/10629360802085066.

Abstract

A novel representation of proteins was introduced. It is independent of arbitrary decisions with respect to the choice of labels to be assigned to the 20 natural amino acids. The approach is based on an assignment of 20 unit vectors in 20-dimensional vector space to the 20 natural amino acids. Proteins are then represented by a walk, that is, a sequence of steps in the 20-dimensional space analogous to a walk in the (x, y) plane in the case of binary strings. A straightforward numerical characterization of proteins is obtained from the distance matrix associated with the walk representing the protein in 20-dimensional space combining the information on the Euclidean distance between various amino acids in protein sequence. The Line Distance matrix offers additional numerical characterization of proteins, while the lengths of steps of the walk in 20-D space allow construction of a "protein profile," which represents distribution of average lengths of the steps and their powers.

摘要

引入了一种蛋白质的新表示方法。它不依赖于在为20种天然氨基酸分配标签时的任意决定。该方法基于在20维向量空间中为20种天然氨基酸分配20个单位向量。然后,蛋白质由一条路径表示,即20维空间中的一系列步骤,类似于二进制字符串情况下在(x, y)平面中的路径。通过与表示20维空间中蛋白质的路径相关联的距离矩阵,结合蛋白质序列中各种氨基酸之间欧几里得距离的信息,可获得蛋白质的直接数值表征。线距离矩阵提供了蛋白质的额外数值表征,而20维空间中路径的步长允许构建一个“蛋白质轮廓”,它表示步长及其幂的平均长度分布。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验