• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于分析蛋白质序列的广义迭代映射。

A Generalized Iterative Map for Analysis of Protein Sequences.

机构信息

School of Science, Zhejiang Sci-Tech University, Hangzhou,China.

College of Life Science, Zhejiang Sci-Tech University, Hangzhou,China.

出版信息

Comb Chem High Throughput Screen. 2022;25(3):381-391. doi: 10.2174/1386207323666201012142318.

DOI:10.2174/1386207323666201012142318
PMID:33045963
Abstract

AIM AND OBJECTIVE

The similarities comparison of biological sequences is an important task in bioinformatics. The methods of the similarities comparison for biological sequences are divided into two classes: sequence alignment method and alignment-free method. The graphical representation of biological sequences is a kind of alignment-free method, which constitutes a tool for analyzing and visualizing the biological sequences. In this article, a generalized iterative map of protein sequences was suggested to analyze the similarities of biological sequences.

MATERIALS AND METHODS

Based on the normalized physicochemical indexes of 20 amino acids, each amino acid can be mapped into a point in 5D space. A generalized iterative function system was introduced to outline a generalized iterative map of protein sequences, which can not only reflect various physicochemical properties of amino acids but also incorporate with different compression ratios of the component of a generalized iterative map. Several properties were proved to illustrate the advantage of the generalized iterative map. The mathematical description of the generalized iterative map was suggested to compare the similarities and dissimilarities of protein sequences. Based on this method, similarities/dissimilarities were compared among ND5 protein sequences, as well as ND6 protein sequences of ten different species.

RESULTS

By correlation analysis, the ClustalW results were compared with our similarity/dissimilarity results and other graphical representation results to show the utility of our approach. The comparison results show that our approach has better correlations with ClustalW for all species than other approaches and illustrate the effectiveness of our approach.

CONCLUSION

Two examples show that our method not only has good performances and effects in the similarity/dissimilarity analysis of protein sequences but also does not require complex computation.

摘要

目的和目标

生物序列的相似性比较是生物信息学中的一项重要任务。生物序列相似性比较的方法分为两类:序列比对方法和无比对方法。生物序列的图形表示是一种无比对方法,它构成了分析和可视化生物序列的工具。在本文中,提出了一种蛋白质序列的广义迭代图来分析生物序列的相似性。

材料和方法

基于 20 种氨基酸的归一化物理化学指标,每个氨基酸都可以映射到 5D 空间中的一个点。引入了广义迭代函数系统来描绘蛋白质序列的广义迭代图,它不仅可以反映氨基酸的各种物理化学性质,还可以与广义迭代图的组成部分的不同压缩比相结合。证明了几个性质来说明广义迭代图的优势。提出了广义迭代图的数学描述来比较蛋白质序列的相似性和相异性。基于这种方法,比较了 ND5 蛋白序列和十种不同物种的 ND6 蛋白序列之间的相似性/相异性。

结果

通过相关分析,将 ClustalW 的结果与我们的相似性/相异性结果以及其他图形表示结果进行了比较,以展示我们方法的实用性。比较结果表明,与其他方法相比,我们的方法与所有物种的 ClustalW 相关性更好,说明了我们方法的有效性。

结论

两个示例表明,我们的方法不仅在蛋白质序列的相似性/相异性分析中具有良好的性能和效果,而且不需要复杂的计算。

相似文献

1
A Generalized Iterative Map for Analysis of Protein Sequences.一种用于分析蛋白质序列的广义迭代映射。
Comb Chem High Throughput Screen. 2022;25(3):381-391. doi: 10.2174/1386207323666201012142318.
2
ADLD: a novel graphical representation of protein sequences and its application.ADLD:一种蛋白质序列的新型图形表示及其应用
Comput Math Methods Med. 2014;2014:959753. doi: 10.1155/2014/959753. Epub 2014 Oct 30.
3
Graphical Representation and Similarity Analysis of Protein Sequences Based on Fractal Interpolation.基于分形插值的蛋白质序列图形表示与相似性分析
IEEE/ACM Trans Comput Biol Bioinform. 2017 Jan-Feb;14(1):182-192. doi: 10.1109/TCBB.2015.2511731. Epub 2015 Dec 29.
4
A 3D graphical representation of protein sequences based on the Gray code.基于格雷码的蛋白质序列三维图形表示。
J Theor Biol. 2012 Jul 7;304:81-7. doi: 10.1016/j.jtbi.2012.03.023. Epub 2012 Apr 1.
5
Normalized feature vectors: a novel alignment-free sequence comparison method based on the numbers of adjacent amino acids.标准化特征向量:一种新颖的基于相邻氨基酸数量的无比对序列比较方法。
IEEE/ACM Trans Comput Biol Bioinform. 2013 Mar-Apr;10(2):457-67. doi: 10.1109/TCBB.2013.10.
6
A protein mapping method based on physicochemical properties and dimension reduction.基于理化性质和降维的蛋白质图谱绘制方法。
Comput Biol Med. 2015 Feb;57:1-7. doi: 10.1016/j.compbiomed.2014.11.012. Epub 2014 Nov 28.
7
Compression-based classification of biological sequences and structures via the Universal Similarity Metric: experimental assessment.通过通用相似性度量对生物序列和结构进行基于压缩的分类:实验评估
BMC Bioinformatics. 2007 Jul 13;8:252. doi: 10.1186/1471-2105-8-252.
8
A novel model for protein sequence similarity analysis based on spectral radius.一种基于谱半径的蛋白质序列相似性分析新模型。
J Theor Biol. 2018 Jun 7;446:61-70. doi: 10.1016/j.jtbi.2018.03.001. Epub 2018 Mar 7.
9
Novel numerical characterization of protein sequences based on individual amino acid and its application.基于单个氨基酸的蛋白质序列新型数值表征及其应用
Biomed Res Int. 2015;2015:909567. doi: 10.1155/2015/909567. Epub 2015 Feb 2.
10
Mapping sequence to feature vector using numerical representation of codons targeted to amino acids for alignment-free sequence analysis.使用针对氨基酸的密码子的数值表示将序列映射到特征向量,用于无比对序列分析。
Gene. 2021 Jan 15;766:145096. doi: 10.1016/j.gene.2020.145096. Epub 2020 Sep 9.