Suppr超能文献

DNA序列的图形和数值表示:相似性的统计学方面

Graphical and numerical representations of DNA sequences: statistical aspects of similarity.

作者信息

Bielińska-Wąż Dorota

机构信息

Instytut Fizyki, Uniwersytet Mikołaja Kopernika, Grudziądzka 5, 87-100 Toruń, Poland.

出版信息

J Math Chem. 2011;49(10):2345. doi: 10.1007/s10910-011-9890-8. Epub 2011 Aug 28.

Abstract

New approaches aiming at a detailed similarity/dissimilarity analysis of DNA sequences are formulated. Several corrections that enrich the information which may be derived from the alignment methods are proposed. The corrections take into account the distributions along the sequences of the aligned bases (neglected in the standard alignment methods). As a consequence, different aspects of similarity, as for example asymmetry of the gene structure, may be studied either using new similarity measures associated with four-component spectral representation of the DNA sequences or using alignment methods with corrections introduced in this paper. The corrections to the alignment methods and the statistical distribution moment-based descriptors derived from the four-component spectral representation of the DNA sequences are applied to similarity/dissimilarity studies of -globin gene across species. The studies are supplemented by detailed similarity studies for histones H1 and H4 coding sequences. The data are described according to the latest version of the EMBL database. The work is supplemented by a concise review of the state-of-art graphical representations of DNA sequences.

摘要

提出了旨在对DNA序列进行详细的相似性/差异性分析的新方法。提出了几种校正方法,这些校正方法丰富了可从比对方法中获得的信息。这些校正考虑了比对碱基沿序列的分布(这在标准比对方法中被忽略)。因此,可以使用与DNA序列的四分量谱表示相关的新相似性度量,或者使用本文引入校正的比对方法,来研究相似性的不同方面,例如基因结构的不对称性。比对方法的校正以及从DNA序列的四分量谱表示导出的基于统计分布矩的描述符,被应用于跨物种的β-珠蛋白基因的相似性/差异性研究。这些研究通过对组蛋白H1和H4编码序列的详细相似性研究得到补充。数据根据EMBL数据库的最新版本进行描述。这项工作还辅以对DNA序列最新图形表示的简要综述。

相似文献

2
Non-standard similarity/dissimilarity analysis of DNA sequences.DNA序列的非标准相似性/相异性分析。
Genomics. 2014 Dec;104(6 Pt B):464-71. doi: 10.1016/j.ygeno.2014.08.010. Epub 2014 Aug 28.
4
Spectral-dynamic representation of DNA sequences.DNA序列的光谱动力学表示
J Biomed Inform. 2017 Aug;72:1-7. doi: 10.1016/j.jbi.2017.06.001. Epub 2017 Jun 3.
6
Classification studies based on a spectral representation of DNA.基于 DNA 光谱表示的分类研究。
J Theor Biol. 2010 Oct 21;266(4):667-74. doi: 10.1016/j.jtbi.2010.07.038. Epub 2010 Aug 4.
7
On the similarity of DNA primary sequences.关于DNA一级序列的相似性。
J Chem Inf Comput Sci. 2000 May-Jun;40(3):599-606. doi: 10.1021/ci9901082.
9
Graphical Representation and Similarity Analysis of Protein Sequences Based on Fractal Interpolation.基于分形插值的蛋白质序列图形表示与相似性分析
IEEE/ACM Trans Comput Biol Bioinform. 2017 Jan-Feb;14(1):182-192. doi: 10.1109/TCBB.2015.2511731. Epub 2015 Dec 29.

引用本文的文献

2
Non-standard bioinformatics characterization of SARS-CoV-2.非标准生物信息学 SARS-CoV-2 特征分析。
Comput Biol Med. 2021 Apr;131:104247. doi: 10.1016/j.compbiomed.2021.104247. Epub 2021 Feb 1.
5
Novel Method of 3-Dimensional Graphical Representation for Proteins and Its Application.蛋白质三维图形表示的新方法及其应用
Evol Bioinform Online. 2018 Jun 12;14:1176934318777755. doi: 10.1177/1176934318777755. eCollection 2018.

本文引用的文献

1
Coronavirus phylogeny based on triplets of nucleic acids bases.基于核酸碱基三联体的冠状病毒系统发育
Chem Phys Lett. 2006 Apr 15;421(4):313-318. doi: 10.1016/j.cplett.2006.01.030. Epub 2006 Feb 20.
2
Classification studies based on a spectral representation of DNA.基于 DNA 光谱表示的分类研究。
J Theor Biol. 2010 Oct 21;266(4):667-74. doi: 10.1016/j.jtbi.2010.07.038. Epub 2010 Aug 4.
10
Genome analysis with inter-nucleotide distances.基于核苷酸间距离的基因组分析。
Bioinformatics. 2009 Dec 1;25(23):3064-70. doi: 10.1093/bioinformatics/btp546. Epub 2009 Sep 16.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验