College of Science, Zhejiang Sci-Tech University, Hangzhou 310018, PR China.
J Theor Biol. 2012 Jul 7;304:81-7. doi: 10.1016/j.jtbi.2012.03.023. Epub 2012 Apr 1.
Based on the order of 6-bit binary Gray code, a cyclic order of 20 amino acids is introduced. A novel 3D graphical representation of protein sequences is proposed according to the CGR of DNA sequences. Furthermore, the mathematical descriptor is suggested to characterize the graphical representation curve. The efficiency of our approach can be illustrated by performing the comparison of similarities/dissimilarities among sequences of the ND5 proteins of nine different species. With the correlation and significance analysis, the comparisons of both our results and results of other graphical representation with the ClustalW's results can show the utility of our approach.
基于 6 位二进制格雷码的顺序,引入了 20 种氨基酸的循环顺序。根据 DNA 序列的 CGR,提出了一种新的蛋白质序列的 3D 图形表示方法。此外,还提出了数学描述符来描述图形表示曲线。通过对 9 个不同物种的 ND5 蛋白序列进行相似性/相异性比较,可以说明我们方法的效率。通过相关性和显著性分析,我们的结果和其他图形表示方法与 ClustalW 结果的比较,可以显示我们方法的实用性。