Suppr超能文献

一种蛋白质的新型数值表示:三维混沌博弈表示及其扩展自然向量。

A novel numerical representation for proteins: Three-dimensional Chaos Game Representation and its Extended Natural Vector.

作者信息

Sun Zeju, Pei Shaojun, He Rong Lucy, Yau Stephen S-T

机构信息

Department of Mathematical Sciences, Tsinghua University, Beijing, PR China.

Department of Biological Sciences, Chicago State University, Chicago, IL 60628, USA.

出版信息

Comput Struct Biotechnol J. 2020 Jul 15;18:1904-1913. doi: 10.1016/j.csbj.2020.07.004. eCollection 2020.

Abstract

Chaos Game Representation (CGR) was first proposed to be an image representation method of DNA and have been extended to the case of other biological macromolecules. Compared with the CGR images of DNA, where DNA sequences are converted into a series of points in the unit square, the existing CGR images of protein are not so elegant in geometry and the implications of the distribution of points in the CGR image are not so obvious. In this study, by naturally distributing the twenty amino acids on the vertices of a regular dodecahedron, we introduce a novel three-dimensional image representation of protein sequences with CGR method. We also associate each CGR image with a vector in high dimensional Euclidean space, called the extended natural vector (ENV), in order to analyze the information contained in the CGR images. Based on the results of protein classification and phylogenetic analysis, our method could serve as a precise method to discover biological relationships between proteins.

摘要

混沌游戏表示法(CGR)最初被提出作为一种DNA的图像表示方法,并已扩展到其他生物大分子的情况。与DNA的CGR图像不同,在DNA的CGR图像中,DNA序列被转换为单位正方形中的一系列点,现有的蛋白质CGR图像在几何形状上不那么优美,并且CGR图像中点的分布含义也不那么明显。在本研究中,通过将二十种氨基酸自然地分布在正十二面体的顶点上,我们用CGR方法引入了一种新的蛋白质序列三维图像表示。我们还将每个CGR图像与高维欧几里得空间中的一个向量相关联,称为扩展自然向量(ENV),以便分析CGR图像中包含的信息。基于蛋白质分类和系统发育分析的结果,我们的方法可以作为一种精确的方法来发现蛋白质之间的生物学关系。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d46a/7390779/f95fdddc2c42/ga1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验