• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

分析基因组序列中的相似性。

Analyzing similarities in genome sequences.

作者信息

Fonseca I C, Nogueira E, Figueirêdo P H, Coutinho S

机构信息

Departamento de Física, Universidade Federal da Paraíba, 58051-970, João Pessoa, PB, Brazil.

Departamento de Física, Universidade Federal Rural de Pernambuco, 52171-900, Recife, PE, Brazil.

出版信息

Eur Phys J E Soft Matter. 2018 Jan 19;41(1):8. doi: 10.1140/epje/i2018-11609-8.

DOI:10.1140/epje/i2018-11609-8
PMID:29349724
Abstract

This article investigates aspects of similarity between complete sequences of mitochondrial DNA by determining the distribution of the relative frequencies of words with different lengths and the characteristics of their relevance throughout the sequences. The degree of similarity is obtained by comparing the distances between words contained within these sequences. Our results indicate that the best groupings among different species depend on the lengths of words and their respective relative frequencies. We also observed that the longer the word the more consistent the grouping between the sequences becomes. The application of our results, together with the perspective of analyzing DNA sequences belonging to a single biological species, may be important for the construction of phylogenetic trees, which are appropriate structures for understanding the evolutionary history of the species.

摘要

本文通过确定不同长度单词的相对频率分布及其在整个序列中的相关性特征,研究了线粒体DNA完整序列之间的相似性。相似程度是通过比较这些序列中所含单词之间的距离获得的。我们的结果表明,不同物种之间的最佳分组取决于单词的长度及其各自的相对频率。我们还观察到,单词越长,序列之间的分组就越一致。将我们的结果应用于分析属于单一生物物种的DNA序列,对于构建系统发育树可能很重要,系统发育树是理解物种进化历史的合适结构。

相似文献

1
Analyzing similarities in genome sequences.分析基因组序列中的相似性。
Eur Phys J E Soft Matter. 2018 Jan 19;41(1):8. doi: 10.1140/epje/i2018-11609-8.
2
Complete mitochondrial DNA sequence of oyster Crassostrea hongkongensis-a case of "Tandem duplication-random loss" for genome rearrangement in Crassostrea?香港牡蛎线粒体DNA全序列——牡蛎基因组重排中“串联重复-随机丢失”的一个案例?
BMC Genomics. 2008 Oct 11;9:477. doi: 10.1186/1471-2164-9-477.
3
Complete Mitochondrial Genome of Anoplocephala magna Solidifying the Species.巨裸头绦虫完整线粒体基因组巩固了该物种地位
Korean J Parasitol. 2016 Jun;54(3):369-73. doi: 10.3347/kjp.2016.54.3.369. Epub 2016 Jun 30.
4
Analysis of the complete mitochondrial genome of Pochonia chlamydosporia suggests a close relationship to the invertebrate-pathogenic fungi in Hypocreales.对厚垣孢普可尼亚菌完整线粒体基因组的分析表明,它与肉座菌目中的无脊椎动物致病真菌关系密切。
BMC Microbiol. 2015 Jan 31;15:5. doi: 10.1186/s12866-015-0341-8.
5
The complete mitochondrial genome of Anoplocephala perfoliata, the first representative for the family Anoplocephalidae.叶状裸头绦虫的完整线粒体基因组,是裸头绦虫科的首个代表物种。
Parasit Vectors. 2015 Oct 21;8:549. doi: 10.1186/s13071-015-1172-z.
6
An improved model for whole genome phylogenetic analysis by Fourier transform.一种通过傅里叶变换进行全基因组系统发育分析的改进模型。
J Theor Biol. 2015 Oct 7;382:99-110. doi: 10.1016/j.jtbi.2015.06.033. Epub 2015 Jul 4.
7
Complete mitochondrial genomes reveal phylogeny relationship and evolutionary history of the family Felidae.完整的线粒体基因组揭示了猫科动物家族的系统发育关系和进化历史。
Genet Mol Res. 2013 Sep 3;12(3):3256-62. doi: 10.4238/2013.September.3.1.
8
The 135 kbp mitochondrial genome of Agaricus bisporus is the largest known eukaryotic reservoir of group I introns and plasmid-related sequences.双孢蘑菇的 135 kbp 线粒体基因组是已知真核生物中最大的 I 组内含子和质粒相关序列库。
Fungal Genet Biol. 2013 Jun;55:85-91. doi: 10.1016/j.fgb.2013.01.009. Epub 2013 Feb 19.
9
Complete mitochondrial genome and phylogenetic relationship analyses of Amphioctopus aegina (Gray, 1849) (Cephalopoda: Octopodidae).艾氏耳乌贼(Gray,1849年)(头足纲:章鱼科)的线粒体全基因组及系统发育关系分析
Mitochondrial DNA A DNA Mapp Seq Anal. 2017 Jan;28(1):17-18. doi: 10.3109/19401736.2015.1106522. Epub 2015 Dec 18.
10
Structure and evolution of the Phasianidae mitochondrial DNA control region.雉科线粒体DNA控制区的结构与进化
Mitochondrial DNA A DNA Mapp Seq Anal. 2016;27(1):350-4. doi: 10.3109/19401736.2014.895987. Epub 2014 Mar 11.

本文引用的文献

1
Distinguishing Functional DNA Words; A Method for Measuring Clustering Levels.区分功能 DNA 词;一种测量聚类水平的方法。
Sci Rep. 2017 Jan 27;7:41543. doi: 10.1038/srep41543.
2
Probability distribution of intersymbol distances in random symbolic sequences: Applications to improving detection of keywords in texts and of amino acid clustering in proteins.符号序列中符号间距离的概率分布:在提高文本中关键词检测和蛋白质中氨基酸聚类方面的应用。
Phys Rev E. 2016 Nov;94(5-1):052302. doi: 10.1103/PhysRevE.94.052302. Epub 2016 Nov 4.
3
A Guaranteed Similarity Metric Learning Framework for Biological Sequence Comparison.
一种用于生物序列比较的保证相似性度量学习框架。
IEEE/ACM Trans Comput Biol Bioinform. 2016 Sep-Oct;13(5):868-877. doi: 10.1109/TCBB.2015.2495186. Epub 2015 Oct 26.
4
Clustering of DNA words and biological function: a proof of principle.DNA 单词聚类与生物功能:原理验证。
J Theor Biol. 2012 Mar 21;297:127-36. doi: 10.1016/j.jtbi.2011.12.024. Epub 2011 Dec 30.
5
Objective method for estimating asymptotic parameters, with an application to sequence alignment.估计渐近参数的客观方法及其在序列比对中的应用。
Phys Rev E Stat Nonlin Soft Matter Phys. 2011 Sep;84(3 Pt 1):031914. doi: 10.1103/PhysRevE.84.031914. Epub 2011 Sep 13.
6
Statistical physics approach to categorize biologic signals: from heart rate dynamics to DNA sequences.用于对生物信号进行分类的统计物理学方法:从心率动态到DNA序列
Chaos. 2007 Mar;17(1):015115. doi: 10.1063/1.2716147.
7
Words in DNA sequences: some case studies based on their frequency statistics.DNA序列中的词汇:基于频率统计的一些案例研究
J Math Biol. 2003 Jun;46(6):479-503. doi: 10.1007/s00285-002-0185-3.
8
Multifractal analysis of DNA walks and trails.DNA游走与轨迹的多重分形分析
Phys Rev E Stat Nonlin Soft Matter Phys. 2002 Dec;66(6 Pt 1):061906. doi: 10.1103/PhysRevE.66.061906. Epub 2002 Dec 18.
9
Characterizing long-range correlations in DNA sequences from wavelet analysis.通过小波分析表征DNA序列中的长程相关性。
Phys Rev Lett. 1995 Apr 17;74(16):3293-3296. doi: 10.1103/PhysRevLett.74.3293.
10
Systematic analysis of coding and noncoding DNA sequences using methods of statistical linguistics.使用统计语言学方法对编码和非编码DNA序列进行系统分析。
Phys Rev E Stat Phys Plasmas Fluids Relat Interdiscip Topics. 1995 Sep;52(3):2939-50. doi: 10.1103/physreve.52.2939.