Thioulouse J, Lobry J R
Laboratoire de Biométrie, Génétique et Biologie des Populations URA CNRS 243 Université Lyon, Villeurbanne, France.
Comput Appl Biosci. 1995 Jun;11(3):321-9. doi: 10.1093/bioinformatics/11.3.321.
A multivariate analysis method called co-inertia analysis was used to determine the main relationships between two data tables having identical rows. This method is available in the ADE multivariate analysis package for Macintosh micro-computers. It was applied to two data sets, one containing the amino-acid composition of 999 E. coli proteins, and the other the values of 402 physico-chemical properties for the 20 natural amino-acids. There were strong relationships between amino-acid physico-chemical properties and the composition of proteins. The first common factor was hydrophobicity; it is linked to the biological environment of proteins, either in the cytoplasm (or outside the cell), or in the nonpolar environment of the phospholipid bilayer of biological membranes. The second factor linked the expressivity of protein genes and the propensity of amino-acids to form alpha helix/beta sheets. The third factor showed that heavy, aromatic amino-acids tend to be avoided, except when they are needed for structural or functional reasons. These results are discussed in terms of selective pressure acting on amino-acid composition of proteins.
一种名为共惯性分析的多元分析方法被用于确定具有相同行的两个数据表之间的主要关系。该方法可在适用于Macintosh微型计算机的ADE多元分析软件包中找到。它被应用于两个数据集,一个包含999种大肠杆菌蛋白质的氨基酸组成,另一个包含20种天然氨基酸的402种物理化学性质的值。氨基酸的物理化学性质与蛋白质组成之间存在很强的关系。第一个共同因素是疏水性;它与蛋白质的生物环境有关,无论是在细胞质(或细胞外)中,还是在生物膜磷脂双层的非极性环境中。第二个因素将蛋白质基因的表达能力与氨基酸形成α螺旋/β折叠的倾向联系起来。第三个因素表明,除了出于结构或功能原因需要时,重的、芳香族氨基酸往往会被避免。这些结果从作用于蛋白质氨基酸组成的选择压力方面进行了讨论。