Suppr超能文献

基于广义拟氨基酸组成的 DNA 序列系统发育分析。

Phylogenetic analysis of DNA sequences based on the generalized pseudo-amino acid composition.

机构信息

School of Mathematical Sciences, Dalian University of Technology, Dalian, Liaoning 116024, PR China.

出版信息

J Theor Biol. 2011 Jan 21;269(1):217-23. doi: 10.1016/j.jtbi.2010.10.027. Epub 2010 Oct 30.

Abstract

The main work of this paper is to propose a new theory and method, which is based on the idea of the pseudo-amino acid composition, for phylogenetic analysis of DNA primary sequences. In our method, we revise the part of the occurrence frequency of 20 amino acids in the method of the pseudo-amino acid composition by replacing the frequency of 16 dinucleotides. And we select eight LZ complexity factors of eight (0,1) sequences of a DNA primary sequence as PseAA components. Finally, we characterize a DNA sequence with a 24-dimensional vector. We reconstruct the phylogenetic trees of two datasets. The results show that our method is efficient and significant.

摘要

本文的主要工作是提出一种新的理论和方法,该方法基于伪氨基酸组成的思想,用于 DNA 一级序列的系统发育分析。在我们的方法中,通过替换 16 个二核苷酸的频率,对伪氨基酸组成方法中 20 种氨基酸出现频率的部分进行了修正。并选择 DNA 一级序列的八个(0,1)序列的八个 LZ 复杂度因子作为 PseAA 成分。最后,用一个 24 维向量来描述一个 DNA 序列。我们重建了两个数据集的系统发育树。结果表明,我们的方法是有效和显著的。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验