Suppr超能文献

基于扩散熵的平衡估计寻找DNA序列的编码区和非编码区。

In search of coding and non-coding regions of DNA sequences based on balanced estimation of diffusion entropy.

作者信息

Zhang Jin, Zhang Wenqing, Yang Huijie

机构信息

Business School, University of Shanghai for Science and Technology, Shanghai, 200093, China.

School of Information Science and Engineering, University of Jinan, Jinan, 250022, China.

出版信息

J Biol Phys. 2016 Jan;42(1):99-106. doi: 10.1007/s10867-015-9399-7. Epub 2015 Aug 29.

Abstract

Identification of coding regions in DNA sequences remains challenging. Various methods have been proposed, but these are limited by species-dependence and the need for adequate training sets. The elements in DNA coding regions are known to be distributed in a quasi-random way, while those in non-coding regions have typical similar structures. For short sequences, these statistical characteristics cannot be extracted correctly and cannot even be detected. This paper introduces a new way to solve the problem: balanced estimation of diffusion entropy (BEDE).

摘要

识别DNA序列中的编码区域仍然具有挑战性。人们已经提出了各种方法,但这些方法受到物种依赖性和对足够训练集需求的限制。已知DNA编码区域中的元件以准随机方式分布,而非编码区域中的元件具有典型的相似结构。对于短序列,这些统计特征无法正确提取,甚至无法检测到。本文介绍了一种解决该问题的新方法:扩散熵的平衡估计(BEDE)。

相似文献

4
6
Zones of low entropy in genomic sequences.基因组序列中的低熵区域。
Comput Chem. 1999 Jun 15;23(3-4):275-82. doi: 10.1016/s0097-8485(99)00009-1.
8
Detecting non-coding selective pressure in coding regions.检测编码区域中的非编码选择性压力。
BMC Evol Biol. 2007 Feb 8;7 Suppl 1(Suppl 1):S9. doi: 10.1186/1471-2148-7-S1-S9.
10
Linguistic features of noncoding DNA sequences.非编码DNA序列的语言特征。
Phys Rev Lett. 1994 Dec 5;73(23):3169-72. doi: 10.1103/PhysRevLett.73.3169.

本文引用的文献

2
Hurst exponents for short time series.短时间序列的赫斯特指数。
Phys Rev E Stat Nonlin Soft Matter Phys. 2011 Dec;84(6 Pt 2):066114. doi: 10.1103/PhysRevE.84.066114. Epub 2011 Dec 19.
3
Temporal series analysis approach to spectra of complex networks.复杂网络频谱的时间序列分析方法。
Phys Rev E Stat Nonlin Soft Matter Phys. 2004 Jun;69(6 Pt 2):066104. doi: 10.1103/PhysRevE.69.066104. Epub 2004 Jun 2.
5
Lévy scaling: the diffusion entropy analysis applied to DNA sequences.列维标度:应用于DNA序列的扩散熵分析。
Phys Rev E Stat Nonlin Soft Matter Phys. 2002 Sep;66(3 Pt 1):031906. doi: 10.1103/PhysRevE.66.031906. Epub 2002 Sep 20.
6
Scaling breakdown: a signature of aging.标度破坏:衰老的一种特征。
Phys Rev E Stat Nonlin Soft Matter Phys. 2002 Jul;66(1 Pt 2):015101. doi: 10.1103/PhysRevE.66.015101. Epub 2002 Jul 12.
7
Diffusion entropy and waiting time statistics of hard-x-ray solar flares.硬X射线太阳耀斑的扩散熵与等待时间统计
Phys Rev E Stat Nonlin Soft Matter Phys. 2002 Apr;65(4 Pt 2A):046203. doi: 10.1103/PhysRevE.65.046203. Epub 2002 Mar 25.
8
Scaling features of noncoding DNA.非编码DNA的缩放特征。
Physica A. 1999;273(1-2):1-18. doi: 10.1016/s0378-4371(99)00407-0.
9
Frequency-domain analysis of biomolecular sequences.生物分子序列的频域分析。
Bioinformatics. 2000 Dec;16(12):1073-81. doi: 10.1093/bioinformatics/16.12.1073.
10
Nonlinear modeling technique for the analysis of DNA chains.用于分析DNA链的非线性建模技术。
Phys Rev E Stat Phys Plasmas Fluids Relat Interdiscip Topics. 2000 Feb;61(2):1812-5. doi: 10.1103/physreve.61.1812.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验