• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于全局和局部位置信息的 DNA 序列的一种新表示。

One novel representation of DNA sequence based on the global and local position information.

机构信息

School of Information and Electronic Engineering, Wuzhou University, Wuzhu, China.

College of Computer Science and Electronic Engineering, Hunan University, Hunan, China.

出版信息

Sci Rep. 2018 May 15;8(1):7592. doi: 10.1038/s41598-018-26005-3.

DOI:10.1038/s41598-018-26005-3
PMID:29765099
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5953932/
Abstract

One novel representation of DNA sequence combining the global and local position information of the original sequence has been proposed to distinguish the different species. First, for the sufficient exploitation of global information, one graphical representation of DNA sequence has been formulated according to the curve of Fermat spiral. Then, for the consideration of local characteristics of DNA sequence, attaching each point in the curve of Fermat spiral with the related mass has been applied based on the relationships of neighboring four nucleotides. In this paper, the normalized moments of inertia of the curve of Fermat spiral which composed by the points with mass has been calculated as the numerical description of the corresponding DNA sequence on the first exons of beta-global genes. Choosing the Euclidean distance as the measurement of the numerical descriptions, the similarity between species has shown the performance of proposed method.

摘要

已经提出了一种将 DNA 序列的全局和局部位置信息结合在一起的新表示方法,以区分不同的物种。首先,为了充分利用全局信息,根据费马螺线的曲线,制定了一种 DNA 序列的图形表示方法。然后,为了考虑 DNA 序列的局部特征,根据相邻四个核苷酸的关系,将费马螺线曲线上的每个点与相关质量附加在一起。在本文中,计算了由带质量的点组成的费马螺线的归一化惯性矩,作为β-全局基因第一个外显子上相应 DNA 序列的数值描述。选择欧几里得距离作为数值描述的度量,物种间的相似性显示了所提出方法的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68ea/5953932/48b411386bf8/41598_2018_26005_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68ea/5953932/f08fd1dbc104/41598_2018_26005_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68ea/5953932/47d4483acc78/41598_2018_26005_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68ea/5953932/48b411386bf8/41598_2018_26005_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68ea/5953932/f08fd1dbc104/41598_2018_26005_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68ea/5953932/47d4483acc78/41598_2018_26005_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68ea/5953932/48b411386bf8/41598_2018_26005_Fig3_HTML.jpg

相似文献

1
One novel representation of DNA sequence based on the global and local position information.基于全局和局部位置信息的 DNA 序列的一种新表示。
Sci Rep. 2018 May 15;8(1):7592. doi: 10.1038/s41598-018-26005-3.
2
Numerical characterization of DNA sequences based on digital signal method.基于数字信号方法的DNA序列数值表征
Comput Biol Med. 2009 Apr;39(4):388-91. doi: 10.1016/j.compbiomed.2009.01.009. Epub 2009 Mar 3.
3
TN curve: a novel 3D graphical representation of DNA sequence based on trinucleotides and its applications.TN 曲线:一种基于三核苷酸的新型 DNA 序列三维图形表示及其应用。
J Theor Biol. 2009 Dec 7;261(3):459-68. doi: 10.1016/j.jtbi.2009.08.005. Epub 2009 Aug 11.
4
Non-standard similarity/dissimilarity analysis of DNA sequences.DNA序列的非标准相似性/相异性分析。
Genomics. 2014 Dec;104(6 Pt B):464-71. doi: 10.1016/j.ygeno.2014.08.010. Epub 2014 Aug 28.
5
FermatS: A Novel Numerical Representation for Protein Sequence Comparison and DNA-binding Protein Identification.费马斯特:一种用于蛋白质序列比较和DNA结合蛋白识别的新型数字表示法。
Comb Chem High Throughput Screen. 2021;24(10):1746-1753. doi: 10.2174/1386207323999201117111738.
6
Similarity studies of DNA sequences based on a new 2D graphical representation.基于一种新的二维图形表示法的DNA序列相似性研究。
Biophys Chem. 2009 Jul;143(1-2):55-9. doi: 10.1016/j.bpc.2009.03.013. Epub 2009 Apr 8.
7
Graphical Representation and Similarity Analysis of DNA Sequences Based on Trigonometric Functions.基于三角函数的DNA序列图形表示与相似性分析
Acta Biotheor. 2018 Jun;66(2):113-133. doi: 10.1007/s10441-018-9324-0. Epub 2018 Apr 19.
8
PNN-curve: a new 2D graphical representation of DNA sequences and its application.PNN曲线:一种DNA序列的新型二维图形表示及其应用
J Theor Biol. 2006 Dec 21;243(4):555-61. doi: 10.1016/j.jtbi.2006.07.018. Epub 2006 Jul 24.
9
Numerical characterization of DNA sequence based on dinucleotides.基于二核苷酸的DNA序列数值表征
ScientificWorldJournal. 2012;2012:104269. doi: 10.1100/2012/104269. Epub 2012 Apr 24.
10
Graphical representation for DNA sequences via joint diagonalization of matrix pencil.通过矩阵束的联合对角化对 DNA 序列进行图形表示。
IEEE J Biomed Health Inform. 2013 May;17(3):503-11. doi: 10.1109/titb.2012.2227146.

引用本文的文献

1
Bioinformatics tools for the sequence complexity estimates.用于序列复杂性估计的生物信息学工具。
Biophys Rev. 2023 Sep 15;15(5):1367-1378. doi: 10.1007/s12551-023-01140-y. eCollection 2023 Oct.
2
DCiPatho: deep cross-fusion networks for genome scale identification of pathogens.DCiPatho:用于大规模病原体基因组识别的深度交叉融合网络。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad194.
3
Detection of intra-family coronavirus genome sequences through graphical representation and artificial neural network.通过图形表示和人工神经网络检测家庭内部冠状病毒基因组序列

本文引用的文献

1
Coronavirus phylogeny based on triplets of nucleic acids bases.基于核酸碱基三联体的冠状病毒系统发育
Chem Phys Lett. 2006 Apr 15;421(4):313-318. doi: 10.1016/j.cplett.2006.01.030. Epub 2006 Feb 20.
2
Spectral-dynamic representation of DNA sequences.DNA序列的光谱动力学表示
J Biomed Inform. 2017 Aug;72:1-7. doi: 10.1016/j.jbi.2017.06.001. Epub 2017 Jun 3.
3
Circular Helix-Like Curve: An Effective Tool of Biological Sequence Analysis and Comparison.环形螺旋状曲线:生物序列分析与比较的有效工具
Expert Syst Appl. 2022 May 15;194:116559. doi: 10.1016/j.eswa.2022.116559. Epub 2022 Jan 21.
4
Non-standard bioinformatics characterization of SARS-CoV-2.非标准生物信息学 SARS-CoV-2 特征分析。
Comput Biol Med. 2021 Apr;131:104247. doi: 10.1016/j.compbiomed.2021.104247. Epub 2021 Feb 1.
5
Control of Macromolecule Chains Structure in a Nanofiber.纳米纤维中大分子链结构的控制
Polymers (Basel). 2020 Oct 8;12(10):2305. doi: 10.3390/polym12102305.
6
A new graph-theoretic approach to determine the similarity of genome sequences based on nucleotide triplets.一种新的基于三核苷酸的图论方法来确定基因组序列的相似性。
Genomics. 2020 Nov;112(6):4701-4714. doi: 10.1016/j.ygeno.2020.08.023. Epub 2020 Aug 19.
Comput Math Methods Med. 2016;2016:3262813. doi: 10.1155/2016/3262813. Epub 2016 Jun 14.
4
Graphical Representation and Similarity Analysis of Protein Sequences Based on Fractal Interpolation.基于分形插值的蛋白质序列图形表示与相似性分析
IEEE/ACM Trans Comput Biol Bioinform. 2017 Jan-Feb;14(1):182-192. doi: 10.1109/TCBB.2015.2511731. Epub 2015 Dec 29.
5
20D-dynamic representation of protein sequences.蛋白质序列的20D动态表示
Genomics. 2016 Jan;107(1):16-23. doi: 10.1016/j.ygeno.2015.12.003. Epub 2015 Dec 17.
6
Similarity evaluation of DNA sequences based on frequent patterns and entropy.基于频繁模式和熵的DNA序列相似性评估
BMC Genomics. 2015;16 Suppl 3(Suppl 3):S5. doi: 10.1186/1471-2164-16-S3-S5. Epub 2015 Jan 29.
7
An efficient numerical method for protein sequences similarity analysis based on a new two-dimensional graphical representation.一种基于新的二维图形表示的蛋白质序列相似性分析高效数值方法。
SAR QSAR Environ Res. 2015;26(2):125-37. doi: 10.1080/1062936X.2014.995700.
8
Representation of DNA sequences in genetic codon context with applications in exon and intron prediction.遗传密码子背景下DNA序列的表示及其在外显子和内含子预测中的应用。
J Bioinform Comput Biol. 2015 Apr;13(2):1550004. doi: 10.1142/S0219720015500043. Epub 2014 Dec 10.
9
A novel method for comparative analysis of DNA sequences by Ramanujan-Fourier transform.一种通过拉马努金-傅里叶变换对DNA序列进行比较分析的新方法。
J Comput Biol. 2014 Dec;21(12):867-79. doi: 10.1089/cmb.2014.0120.
10
Primary structure similarity analysis of proteins sequences by a new graphical representation.基于一种新的图形表示法的蛋白质序列一级结构相似性分析
SAR QSAR Environ Res. 2014;25(10):791-803. doi: 10.1080/1062936X.2014.955055. Epub 2014 Sep 22.