• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于重复k字分布的各种基因组序列分类

Classification of various genomic sequences based on distribution of repeated k-word.

出版信息

Annu Int Conf IEEE Eng Med Biol Soc. 2017 Jul;2017:3894-3897. doi: 10.1109/EMBC.2017.8037707.

DOI:10.1109/EMBC.2017.8037707
PMID:29060748
Abstract

In order to extract phylogenetic information from DNA sequences, alignment-free methods and alignment-based methods are used. Alignment-based methods have high complexity and conventional alignment-free methods have low accuracy. In this paper, a new alignment-free method based on the distribution of repeated k-word measure is proposed. This novel measure is based on k-words and its multiple repeated words. We can get higher performance than conventional word count methods in case of using proposed scheme while maintaining total time complexity. The proposed measure shows better performance compared to conventional alignment-free methods with respect to RF distance.

摘要

为了从DNA序列中提取系统发育信息,人们使用了无比对方法和基于比对的方法。基于比对的方法具有高复杂性,而传统的无比对方法准确性较低。本文提出了一种基于重复k字度量分布的新的无比对方法。这种新颖的度量基于k字及其多个重复词。在保持总时间复杂度的情况下,使用所提出的方案时,我们可以获得比传统词计数方法更高的性能。在所提出的度量与传统无比对方法相比,在RF距离方面表现出更好的性能。

相似文献

1
Classification of various genomic sequences based on distribution of repeated k-word.基于重复k字分布的各种基因组序列分类
Annu Int Conf IEEE Eng Med Biol Soc. 2017 Jul;2017:3894-3897. doi: 10.1109/EMBC.2017.8037707.
2
Genome classification improvements based on k-mer intervals in sequences.基于序列中 k-mer 间隔的基因组分类改进。
Genomics. 2019 Dec;111(6):1574-1582. doi: 10.1016/j.ygeno.2018.11.001. Epub 2018 Nov 13.
3
A new alignment free genome comparison algorithm based on statistically estimated feature frequency profile.一种基于统计估计特征频率分布的新型无比对基因组比较算法。
Annu Int Conf IEEE Eng Med Biol Soc. 2017 Jul;2017:4265-4268. doi: 10.1109/EMBC.2017.8037798.
4
Extraction of high quality k-words for alignment-free sequence comparison.用于无比对序列比较的高质量k词提取。
J Theor Biol. 2014 Oct 7;358:31-51. doi: 10.1016/j.jtbi.2014.05.016. Epub 2014 May 17.
5
Fast alignment-free sequence comparison using spaced-word frequencies.基于空位词频的快速无比对序列比较。
Bioinformatics. 2014 Jul 15;30(14):1991-9. doi: 10.1093/bioinformatics/btu177. Epub 2014 Apr 3.
6
CD-MAWS: An Alignment-Free Phylogeny Estimation Method Using Cosine Distance on Minimal Absent Word Sets.CD-MAWS:一种基于最小缺失词集余弦距离的无比对系统发育估计方法。
IEEE/ACM Trans Comput Biol Bioinform. 2023 Jan-Feb;20(1):196-205. doi: 10.1109/TCBB.2021.3136792. Epub 2023 Feb 3.
7
Fast and accurate phylogeny reconstruction using filtered spaced-word matches.使用过滤后的间隔词匹配进行快速准确的系统发育重建。
Bioinformatics. 2017 Apr 1;33(7):971-979. doi: 10.1093/bioinformatics/btw776.
8
Sequence Comparison Without Alignment: The SpaM Approaches.无需比对的序列比较:SpaM方法
Methods Mol Biol. 2021;2231:121-134. doi: 10.1007/978-1-0716-1036-7_8.
9
Measurement of word frequencies in genomic DNA sequences based on partial alignment and fuzzy set.
J Bioinform Comput Biol. 2014 Aug;12(4):1450019. doi: 10.1142/S021972001450019X. Epub 2014 Aug 1.
10
Alignment-free sequence comparison using joint frequency and position information of k-words.使用k字联合同现频率和位置信息的无比对序列比较
Annu Int Conf IEEE Eng Med Biol Soc. 2017 Jul;2017:3880-3883. doi: 10.1109/EMBC.2017.8037704.

引用本文的文献

1
Compression-Complexity Measures for Analysis and Classification of Coronaviruses.用于冠状病毒分析和分类的压缩-复杂度度量
Entropy (Basel). 2022 Dec 31;25(1):81. doi: 10.3390/e25010081.