Max Planck Institute for Evolutionary Biology, August-Thienemann Strasse 2, 24306 Plön, Germany.
Genes (Basel). 2021 Nov 28;12(12):1913. doi: 10.3390/genes12121913.
We study the potential for the evolution of genes from random nucleotide sequences using libraries of expressing random sequence peptides. We assess the effects of such peptides on cell growth by monitoring frequency changes in individual clones in a complex library through four serial passages. Using a new analysis pipeline that allows the tracing of peptides of all lengths, we find that over half of the peptides have consistent effects on cell growth. Across nine different experiments, around 16% of clones increase in frequency and 36% decrease, with some variation between individual experiments. Shorter peptides (8-20 residues), are more likely to increase in frequency, longer ones are more likely to decrease. GC content, amino acid composition, intrinsic disorder, and aggregation propensity show slightly different patterns between peptide groups. Sequences that increase in frequency tend to be more disordered with lower aggregation propensity. This coincides with the observation that young genes with more disordered structures are better tolerated in genomes. Our data indicate that random sequences can be a source of evolutionary innovation, since a large fraction of them are well tolerated by the cells or can provide a growth advantage.
我们使用表达随机序列肽的文库来研究随机核苷酸序列中基因进化的可能性。我们通过在四个连续传代中监测复杂文库中单个克隆的频率变化,来评估这些肽对细胞生长的影响。使用一种新的分析管道,我们可以追踪所有长度的肽,发现超过一半的肽对细胞生长有一致的影响。在九个不同的实验中,大约有 16%的克隆频率增加,36%的克隆频率减少,不同实验之间存在一些差异。较短的肽(8-20 个残基)更有可能增加频率,较长的肽更有可能减少。GC 含量、氨基酸组成、内在无序性和聚集倾向在肽组之间呈现出略有不同的模式。频率增加的序列往往更无序,聚集倾向更低。这与观察到的年轻基因具有更无序的结构在基因组中更能被容忍的结果一致。我们的数据表明,随机序列可以成为进化创新的来源,因为它们中的很大一部分被细胞很好地耐受,或者可以提供生长优势。