Randic M, Novic M, Vracko M
National Institute of Chemistry, Ljubljana, Slovenia.
SAR QSAR Environ Res. 2008 Apr-Jun;19(3-4):339-49. doi: 10.1080/10629360802085082.
A novel characterization of proteins is presented based on selected properties of recently introduced 20 x 20 amino acid adjacency matrix of proteins in which matrix elements count the occurrence of all 400 possible pair-wise adjacencies obtained by reading protein primary sequence from the left to the right. In particular we consider the characterization based on the sum and the difference of the rows and the corresponding columns, which characterize proteins by a pair of 20-component vectors. The approach is illustrated on a set of ND6 proteins of eight species.
基于最近引入的蛋白质20×20氨基酸邻接矩阵的选定属性,提出了一种蛋白质的新表征方法。在该矩阵中,矩阵元素统计了通过从左到右读取蛋白质一级序列获得的所有400种可能的成对邻接的出现次数。特别是,我们考虑基于行和列的和与差的表征方法,该方法通过一对20维向量来表征蛋白质。在一组八个物种的ND6蛋白质上展示了该方法。