Buldyrev S V, Goldberger A L, Havlin S, Peng C K, Stanley H E, Stanley M H, Simons M
Center for Polymer Studies, Boston University, Massachusetts 02215.
Biophys J. 1993 Dec;65(6):2673-9. doi: 10.1016/S0006-3495(93)81290-6.
Mapping nucleotide sequences onto a "DNA walk" produces a novel representation of DNA that can then be studied quantitatively using techniques derived from fractal landscape analysis. We used this method to analyze 11 complete genomic and cDNA myosin heavy chain (MHC) sequences belonging to 8 different species. Our analysis suggests an increase in fractal complexity for MHC genes with evolution with vertebrate > invertebrate > yeast. The increase in complexity is measured by the presence of long-range power-law correlations, which are quantified by the scaling exponent alpha. We develop a simple iterative model, based on known properties of polymeric sequences, that generates long-range nucleotide correlations from an initially noncorrelated coding region. This new model-as well as the DNA walk analysis-both support the intron-late theory of gene evolution.
将核苷酸序列映射到“DNA 步移”上会产生一种新型的 DNA 表示形式,然后可以使用源自分形景观分析的技术对其进行定量研究。我们使用这种方法分析了属于 8 个不同物种的 11 个完整的基因组和 cDNA 肌球蛋白重链(MHC)序列。我们的分析表明,随着进化,MHC 基因的分形复杂性增加,脊椎动物>无脊椎动物>酵母。复杂性的增加通过长程幂律相关性的存在来衡量,长程幂律相关性由标度指数α量化。我们基于聚合物序列的已知特性开发了一个简单的迭代模型,该模型从最初不相关的编码区域生成长程核苷酸相关性。这个新模型以及 DNA 步移分析都支持基因进化的内含子晚期理论。