Li Chun, Xing Lili, Wang Xin
Department of Mathematics, Bohai University, Jinzhou, PR China.
BMB Rep. 2008 Mar 31;41(3):217-22. doi: 10.5483/bmbrep.2008.41.3.217.
Based on a five-letter model of the 20 amino acids, we propose a new 2-D graphical representation of protein sequence. Then we transform the 2-D graphical representation into a numerical characterization that will facilitate quantitative comparisons of protein sequences. As an application, we construct the phylogenetic tree of 56 coronavirus spike proteins. The resulting tree agrees well with the established taxonomic groups.
基于20种氨基酸的五字母模型,我们提出了一种新的蛋白质序列二维图形表示法。然后,我们将二维图形表示法转换为数值特征,这将有助于对蛋白质序列进行定量比较。作为应用,我们构建了56种冠状病毒刺突蛋白的系统发育树。所得树与既定的分类群非常吻合。