• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PCV:一种用于寻找同源核苷酸序列的无比对方法及其在系统发育研究中的应用。

PCV: An Alignment Free Method for Finding Homologous Nucleotide Sequences and its Application in Phylogenetic Study.

作者信息

Kumar Rajnish, Mishra Bharat Kumar, Lahiri Tapobrata, Kumar Gautam, Kumar Nilesh, Gupta Rahul, Pal Manoj Kumar

机构信息

Department of Applied Science, Indian Institute of Information Technology - Allahabad, Allahabad, UP, 211012, India.

出版信息

Interdiscip Sci. 2017 Jun;9(2):173-183. doi: 10.1007/s12539-015-0136-5. Epub 2016 Jan 29.

DOI:10.1007/s12539-015-0136-5
PMID:26825665
Abstract

Online retrieval of the homologous nucleotide sequences through existing alignment techniques is a common practice against the given database of sequences. The salient point of these techniques is their dependence on local alignment techniques and scoring matrices the reliability of which is limited by computational complexity and accuracy. Toward this direction, this work offers a novel way for numerical representation of genes which can further help in dividing the data space into smaller partitions helping formation of a search tree. In this context, this paper introduces a 36-dimensional Periodicity Count Value (PCV) which is representative of a particular nucleotide sequence and created through adaptation from the concept of stochastic model of Kolekar et al. (American Institute of Physics 1298:307-312, 2010. doi: 10.1063/1.3516320 ). The PCV construct uses information on physicochemical properties of nucleotides and their positional distribution pattern within a gene. It is observed that PCV representation of gene reduces computational cost in the calculation of distances between a pair of genes while being consistent with the existing methods. The validity of PCV-based method was further tested through their use in molecular phylogeny constructs in comparison with that using existing sequence alignment methods.

摘要

通过现有的比对技术在线检索同源核苷酸序列是针对给定序列数据库的常见做法。这些技术的突出特点是依赖局部比对技术和评分矩阵,而其可靠性受到计算复杂性和准确性的限制。朝着这个方向,这项工作提供了一种基因数值表示的新方法,这可以进一步帮助将数据空间划分为更小的分区,有助于形成搜索树。在这种情况下,本文引入了一种36维的周期性计数值(PCV),它代表特定的核苷酸序列,是通过改编Kolekar等人(美国物理研究所1298:307 - 312,2010。doi: 10.1063/1.3516320)的随机模型概念创建的。PCV构建使用了核苷酸的物理化学性质及其在基因内的位置分布模式的信息。据观察,基因的PCV表示在计算一对基因之间的距离时降低了计算成本,同时与现有方法一致。通过将基于PCV的方法与使用现有序列比对方法的方法相比,在分子系统发育构建中的应用进一步测试了其有效性。

相似文献

1
PCV: An Alignment Free Method for Finding Homologous Nucleotide Sequences and its Application in Phylogenetic Study.PCV:一种用于寻找同源核苷酸序列的无比对方法及其在系统发育研究中的应用。
Interdiscip Sci. 2017 Jun;9(2):173-183. doi: 10.1007/s12539-015-0136-5. Epub 2016 Jan 29.
2
A configuration space of homologous proteins conserving mutual information and allowing a phylogeny inference based on pair-wise Z-score probabilities.同源蛋白质的一种构象空间,其保留互信息并允许基于成对Z分数概率进行系统发育推断。
BMC Bioinformatics. 2005 Mar 10;6:49. doi: 10.1186/1471-2105-6-49.
3
Bayesian coestimation of phylogeny and sequence alignment.系统发育与序列比对的贝叶斯联合估计
BMC Bioinformatics. 2005 Apr 1;6:83. doi: 10.1186/1471-2105-6-83.
4
ProtPCV: A Fixed Dimensional Numerical Representation of Protein Sequence to Significantly Reduce Sequence Search Time.ProtPCV:一种用于显著减少序列搜索时间的蛋白质序列固定维数值表示。
Interdiscip Sci. 2020 Sep;12(3):276-287. doi: 10.1007/s12539-020-00380-w. Epub 2020 Jun 10.
5
Representation in stochastic search for phylogenetic tree reconstruction.用于系统发育树重建的随机搜索中的表示法。
J Biomed Inform. 2006 Feb;39(1):43-50. doi: 10.1016/j.jbi.2005.11.001. Epub 2005 Nov 28.
6
Genome sequence comparison under a new form of tri-nucleotide representation based on bio-chemical properties of nucleotides.基于核苷酸生化性质的三核苷酸表示新形式下的基因组序列比较。
Gene. 2020 Mar 10;730:144257. doi: 10.1016/j.gene.2019.144257. Epub 2019 Nov 21.
7
Numerical Characterization of DNA Sequences for Alignment-free Sequence Comparison - A Review.基于无比对的 DNA 序列比对的 DNA 序列数值特征化:综述
Comb Chem High Throughput Screen. 2022;25(3):365-380. doi: 10.2174/1386207324666210811101437.
8
An improved model for whole genome phylogenetic analysis by Fourier transform.一种通过傅里叶变换进行全基因组系统发育分析的改进模型。
J Theor Biol. 2015 Oct 7;382:99-110. doi: 10.1016/j.jtbi.2015.06.033. Epub 2015 Jul 4.
9
An efficient algorithm for statistical multiple alignment on arbitrary phylogenetic trees.一种用于在任意系统发育树上进行统计多重比对的高效算法。
J Comput Biol. 2003;10(6):869-89. doi: 10.1089/106652703322756122.
10
A novel method for comparative analysis of DNA sequences by Ramanujan-Fourier transform.一种通过拉马努金-傅里叶变换对DNA序列进行比较分析的新方法。
J Comput Biol. 2014 Dec;21(12):867-79. doi: 10.1089/cmb.2014.0120.

引用本文的文献

1
CaREM1.4 interacts with CaRIN4 to regulate tolerance by triggering cell death in pepper.CaREM1.4与CaRIN4相互作用,通过引发辣椒细胞死亡来调节耐受性。
Hortic Res. 2023 Mar 28;10(5):uhad053. doi: 10.1093/hr/uhad053. eCollection 2023 May.
2
A Sarcina bacterium linked to lethal disease in sanctuary chimpanzees in Sierra Leone.与塞拉利昂保护区内致死性疾病相关的一种沙雷氏菌。
Nat Commun. 2021 Feb 3;12(1):763. doi: 10.1038/s41467-021-21012-x.
3
A Novel Method for Alignment-free DNA Sequence Similarity Analysis Based on the Characterization of Complex Networks.
一种基于复杂网络特征的无比对DNA序列相似性分析新方法。
Evol Bioinform Online. 2016 Oct 6;12:229-235. doi: 10.4137/EBO.S40474. eCollection 2016.