Corona-Ruiz M, Hernandez-Cabrera Francisco, Cantú-González José Roberto, González-Amezcua O, Javier Almaguer Francisco
Facultad de Ciencias Físico-Matemáticas, Universidad Autónoma de Nuevo León, San Nicolás de los Garza, Mexico.
Escuela de Sistemas PMRV, Unidad Acuña, Universidad Autónoma de Coahuila, Saltillo, Mexico.
Front Genet. 2019 Mar 8;10:66. doi: 10.3389/fgene.2019.00066. eCollection 2019.
This paper presents an exploratory analysis of the mitochondrial DNA (mtDNA) of 32 species in the subphylum Vertebrata, divided in 7 taxonomic classes. Multiple stochastic parameters, such as the Hurst and detrended fluctuation analysis (DFA) exponents, Shannon entropy, and Chargaff ratio are computed for each DNA sequence. The biological interpretation of these parameters leads to defining a triplet of novel indices. These new functions incorporate the long-range correlations, the probability of occurrence of nucleic bases, and the ratio of pyrimidines-to-purines. Results suggest that relevant regions in mtDNA can be located using the proposed indices. Furthermore, early results from clustering algorithms indicate that the indices introduced might be useful in phylogenetic studies.
本文对脊椎动物亚门中7个分类纲的32个物种的线粒体DNA(mtDNA)进行了探索性分析。针对每个DNA序列计算了多个随机参数,如赫斯特指数和去趋势波动分析(DFA)指数、香农熵和查加夫比率。对这些参数的生物学解释导致定义了一组新的指标。这些新函数纳入了长程相关性、核酸碱基出现的概率以及嘧啶与嘌呤的比率。结果表明,使用所提出的指标可以定位mtDNA中的相关区域。此外,聚类算法的早期结果表明,所引入的指标可能在系统发育研究中有用。