Pleissner K P, Wernisch L, Oswald H, Fleck E
Department of Internal Medicine/Cardiology, Virchow-Klinikum of the Humboldt University and German Heart Institute Berlin.
Electrophoresis. 1997 Dec;18(15):2709-13. doi: 10.1002/elps.1150181504.
An algorithm for the representation of amino acid sequences as two-dimensional point patterns (2-D plot) is described. The algorithm is based on chaos game representation (CGR) for DNA sequences and was extended for amino acid sequences. The 2-D plot depicts the sequentiality of amino acids and the amino acid composition of a protein. Changes in a protein sequence as insertion, deletion and repeats of amino acids are characterized by specific geometrical properties and changes in the 2-D plots. The 2-D plot may be considered as a two-dimensional "fingerprint" of a protein. The properties of the algorithm are explained by user-defined amino acid sequences. As an example the 2-D plots of two selected heart proteins are generated. The sequences of these proteins are obtained from the protein sequence database SWISS-PROT.
描述了一种将氨基酸序列表示为二维点模式(二维图)的算法。该算法基于DNA序列的混沌游戏表示(CGR),并扩展到氨基酸序列。二维图描绘了氨基酸的顺序性和蛋白质的氨基酸组成。蛋白质序列中氨基酸的插入、缺失和重复等变化通过二维图的特定几何特性和变化来表征。二维图可被视为蛋白质的二维“指纹”。通过用户定义的氨基酸序列来解释该算法的特性。作为示例,生成了两种选定心脏蛋白的二维图。这些蛋白质的序列来自蛋白质序列数据库SWISS-PROT。