• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过多序列比对熵谱进行蛋白质同源性检测和折叠推断。

Protein homology detection and fold inference through multiple alignment entropy profiles.

作者信息

Sánchez-Flores Alejandro, Pérez-Rueda Ernesto, Segovia Lorenzo

机构信息

Departamento de Ingeniería Celular y Biocatálisis, Instituto de Biotecnología, Universidad Nacional Autónoma de México.

出版信息

Proteins. 2008 Jan 1;70(1):248-56. doi: 10.1002/prot.21506.

DOI:10.1002/prot.21506
PMID:17671981
Abstract

Homology detection and protein structure prediction are central themes in bioinformatics. Establishment of relationship between protein sequences or prediction of their structure by sequence comparison methods finds limitations when there is low sequence similarity. Recent works demonstrate that the use of profiles improves homology detection and protein structure prediction. Profiles can be inferred from protein multiple alignments using different approaches. The "Conservatism-of-Conservatism" is an effective profile analysis method to identify structural features between proteins having the same fold but no detectable sequence similarity. The information obtained from protein multiple alignments varies according to the amino acid classification employed to calculate the profile. In this work, we calculated entropy profiles from PSI-BLAST-derived multiple alignments and used different amino acid classifications summarizing almost 500 different attributes. These entropy profiles were converted into pseudocodes which were compared using the FASTA program with an ad-hoc matrix. We tested the performance of our method to identify relationships between proteins with similar fold using a nonredundant subset of sequences having less than 40% of identity. We then compared our results using Coverage Versus Error per query curves, to those obtained by methods like PSI-BLAST, COMPASS and HHSEARCH. Our method, named HIP (Homology Identification with Profiles) presented higher accuracy detecting relationships between proteins with the same fold. The use of different amino acid classifications reflecting a large number of amino acid attributes, improved the recognition of distantly related folds. We propose the use of pseudocodes representing profile information as a fast and powerful tool for homology detection, fold assignment and analysis of evolutionary information enclosed in protein profiles.

摘要

同源性检测和蛋白质结构预测是生物信息学的核心主题。当序列相似性较低时,通过序列比较方法建立蛋白质序列之间的关系或预测其结构会受到限制。最近的研究表明,使用轮廓可以提高同源性检测和蛋白质结构预测的准确性。轮廓可以通过不同的方法从蛋白质多序列比对中推断出来。“保守性之保守性”是一种有效的轮廓分析方法,用于识别具有相同折叠但无可检测序列相似性的蛋白质之间的结构特征。从蛋白质多序列比对中获得的信息会根据用于计算轮廓的氨基酸分类而有所不同。在这项工作中,我们从PSI-BLAST衍生的多序列比对中计算了熵轮廓,并使用了总结近500种不同属性的不同氨基酸分类。这些熵轮廓被转换为伪代码,然后使用FASTA程序和一个临时矩阵进行比较。我们使用同一性低于40%的非冗余序列子集测试了我们的方法识别具有相似折叠的蛋白质之间关系的性能。然后,我们使用每个查询的覆盖率与错误率曲线,将我们的结果与PSI-BLAST、COMPASS和HHSEARCH等方法获得的结果进行比较。我们的方法名为HIP(基于轮廓的同源性识别),在检测具有相同折叠的蛋白质之间的关系时表现出更高的准确性。使用反映大量氨基酸属性的不同氨基酸分类,提高了对远缘相关折叠的识别。我们建议将表示轮廓信息的伪代码用作同源性检测、折叠分配和分析蛋白质轮廓中包含的进化信息的快速而强大的工具。

相似文献

1
Protein homology detection and fold inference through multiple alignment entropy profiles.通过多序列比对熵谱进行蛋白质同源性检测和折叠推断。
Proteins. 2008 Jan 1;70(1):248-56. doi: 10.1002/prot.21506.
2
An integrated approach to the analysis and modeling of protein sequences and structures. III. A comparative study of sequence conservation in protein structural families using multiple structural alignments.一种蛋白质序列与结构分析及建模的综合方法。III. 使用多重结构比对对蛋白质结构家族中的序列保守性进行比较研究。
J Mol Biol. 2000 Aug 18;301(3):691-711. doi: 10.1006/jmbi.2000.3975.
3
Statistical potential-based amino acid similarity matrices for aligning distantly related protein sequences.用于比对远缘相关蛋白质序列的基于统计势的氨基酸相似性矩阵。
Proteins. 2006 Aug 15;64(3):587-600. doi: 10.1002/prot.21020.
4
Beyond the Twilight Zone: automated prediction of structural properties of proteins by recursive neural networks and remote homology information.超越模糊地带:利用递归神经网络和远程同源信息自动预测蛋白质的结构特性
Proteins. 2009 Oct;77(1):181-90. doi: 10.1002/prot.22429.
5
Analysis and prediction of functional sub-types from protein sequence alignments.基于蛋白质序列比对的功能亚类型分析与预测。
J Mol Biol. 2000 Oct 13;303(1):61-76. doi: 10.1006/jmbi.2000.4036.
6
Fast model-based protein homology detection without alignment.基于快速模型的无需比对的蛋白质同源性检测。
Bioinformatics. 2007 Jul 15;23(14):1728-36. doi: 10.1093/bioinformatics/btm247. Epub 2007 May 8.
7
Accuracy of sequence alignment and fold assessment using reduced amino acid alphabets.使用简化氨基酸字母表进行序列比对和折叠评估的准确性。
Proteins. 2006 Jun 1;63(4):986-95. doi: 10.1002/prot.20881.
8
PFRES: protein fold classification by using evolutionary information and predicted secondary structure.PFRES:利用进化信息和预测的二级结构进行蛋白质折叠分类
Bioinformatics. 2007 Nov 1;23(21):2843-50. doi: 10.1093/bioinformatics/btm475. Epub 2007 Oct 17.
9
Exploring the extremes of sequence/structure space with ensemble fold recognition in the program Phyre.在Phyre程序中使用集成折叠识别方法探索序列/结构空间的极限。
Proteins. 2008 Feb 15;70(3):611-25. doi: 10.1002/prot.21688.
10
A Shannon entropy-based filter detects high- quality profile-profile alignments in searches for remote homologues.一种基于香农熵的过滤器在搜索远源同源物时可检测到高质量的序列轮廓比对。
Proteins. 2004 Feb 1;54(2):351-60. doi: 10.1002/prot.10564.

引用本文的文献

1
Functional Characterisation of Bile Metagenome: Study of Metagenomic Dark Matter.胆汁宏基因组的功能表征:宏基因组暗物质研究
Microorganisms. 2021 Oct 21;9(11):2201. doi: 10.3390/microorganisms9112201.
2
Analysis of sequencing strategies and tools for taxonomic annotation: Defining standards for progressive metagenomics.分析分类注释的测序策略和工具:为逐步宏基因组学定义标准。
Sci Rep. 2018 Aug 13;8(1):12034. doi: 10.1038/s41598-018-30515-5.
3
Remote homology and the functions of metagenomic dark matter.远程同源性与宏基因组暗物质的功能。
Front Genet. 2015 Jul 21;6:234. doi: 10.3389/fgene.2015.00234. eCollection 2015.
4
An ensemble method for predicting subnuclear localizations from primary protein structures.一种基于原始蛋白质结构预测亚核定位的集成方法。
PLoS One. 2013;8(2):e57225. doi: 10.1371/journal.pone.0057225. Epub 2013 Feb 27.
5
Parallel Evolution and Horizontal Gene Transfer of the pst Operon in Firmicutes from Oligotrophic Environments.寡营养环境中厚壁菌门pst操纵子的平行进化与水平基因转移
Int J Evol Biol. 2011 Mar 3;2011:781642. doi: 10.4061/2011/781642.
6
Amino acid classification based spectrum kernel fusion for protein subnuclear localization.基于氨基酸分类的光谱核融合的蛋白质亚核定位。
BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S17. doi: 10.1186/1471-2105-11-S1-S17.