Suppr超能文献

分段 K-mer 及其在线粒体基因组序列相似性分析中的应用。

Segmented K-mer and its application on similarity analysis of mitochondrial genome sequences.

机构信息

Department of Mathematics, School of Science, Anhui Science and Technology University, Fengyang, Anhui 233100, China.

出版信息

Gene. 2013 Apr 15;518(2):419-24. doi: 10.1016/j.gene.2012.12.079. Epub 2013 Jan 23.

Abstract

K-mer-based approach has been widely used in similarity analyses so as to discover similarity/dissimilarity among different biological sequences. In this study, we have improved the traditional K-mer method, and introduce a segmented K-mer approach (s-K-mer). After each primary sequence is divided into several segments, we simultaneously transform all these segments into corresponding K-mer-based vectors. In this approach, it is vital how to determine the optimal combination of distance metric with the number of K and the number of segments, i.e., (K(⁎), s(⁎), and d(⁎)). Based on the cascaded feature vectors transformed from s(⁎) segmented sequences, we analyze 34 mammalian genome sequences using the proposed s-K-mer approach. Meanwhile, we compare the results of s-K-mer with those of traditional K-mer. The contrastive analysis results demonstrate that s-K-mer approach outperforms the traditionally K-mer method on similarity analysis among different species.

摘要

基于 K -mer 的方法已被广泛应用于相似性分析,以发现不同生物序列之间的相似性/差异性。在本研究中,我们改进了传统的 K-mer 方法,并引入了分段 K-mer 方法(s-K-mer)。在将每个主要序列划分为几个片段后,我们同时将所有这些片段转换为相应的基于 K-mer 的向量。在这种方法中,如何确定距离度量与 K 的数量和片段的数量的最佳组合(K(⁎)、s(⁎)和 d(⁎))至关重要。基于从 s(⁎)分段序列转换的级联特征向量,我们使用提出的 s-K-mer 方法分析了 34 种哺乳动物基因组序列。同时,我们将 s-K-mer 的结果与传统 K-mer 的结果进行了比较。对比分析结果表明,s-K-mer 方法在不同物种之间的相似性分析方面优于传统的 K-mer 方法。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验