F Souza Luryane, B de B Pereira Hernane, M da Rocha Filho Tarcisio, A S Machado Bruna, A Moret Marcelo
Centro de Ciências Exatas e das Tecnologias, Universidade Federal do Oeste da Bahia, Barreiras, Bahia, Brazil.
Programa de Modelagem Computacional e Tecnologia Industrial, SENAI-CIMATEC, Salvador, Bahia, Brazil.
PLoS One. 2023 Oct 5;18(10):e0287880. doi: 10.1371/journal.pone.0287880. eCollection 2023.
One of the first steps in protein sequence analysis is comparing sequences to look for similarities. We propose an information theoretical distance to compare cellular automata representing protein sequences, and determine similarities. Our approach relies in a stationary Hamming distance for the evolution of the automata according to a properly chosen rule, and to build a pairwise similarity matrix and determine common ancestors among different species in a simpler and less computationally demanding computer codes when compared to other methods.
蛋白质序列分析的首要步骤之一是比较序列以寻找相似性。我们提出一种信息理论距离来比较代表蛋白质序列的细胞自动机,并确定相似性。我们的方法依赖于根据适当选择的规则为自动机的演化确定一个固定的汉明距离,以构建成对相似性矩阵,并在与其他方法相比时,用更简单且计算要求更低的计算机代码确定不同物种之间的共同祖先。