Department of Biochemistry and Molecular Biology, University of Bari, Italy.
Peptides. 2010 May;31(5):983-8. doi: 10.1016/j.peptides.2010.02.003. Epub 2010 Feb 11.
Discovering the informational rule(s) underlying structure-function relationships in the protein language is at the core of biology. Current theories have proven inadequate to explain the origins of biological information such as that found in nucleotide and amino acid sequences. Here, we demonstrate that the information content of an amino acid motif correlates with the motif rarity. A structured analysis of the scientific literature supports the theory that rare pentapeptide words have higher significance than more common pentapeptides in biological cell 'talk'. This study expands on our previous research showing that the immunological information contained in an amino acid sequence is inversely related to the sequence frequency in the host proteome.
揭示蛋白质语言中结构-功能关系的信息规则是生物学的核心。目前的理论已经被证明不足以解释生物信息的起源,例如核苷酸和氨基酸序列中的信息。在这里,我们证明了氨基酸模体的信息含量与模体稀有性相关。对科学文献的结构化分析支持了这样一种理论,即稀有五肽词在生物细胞“对话”中比更常见的五肽具有更高的意义。这项研究扩展了我们之前的研究,表明氨基酸序列中包含的免疫学信息与宿主蛋白质组中的序列频率成反比。