• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用对齐模式簇对蛋白质家族的结合片段进行排序和压缩。

Ranking and compacting binding segments of protein families using aligned pattern clusters.

出版信息

Proteome Sci. 2013 Nov 7;11(Suppl 1):S8. doi: 10.1186/1477-5956-11-S1-S8.

DOI:10.1186/1477-5956-11-S1-S8
PMID:24564874
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3907781/
Abstract

BACKGROUND

Discovering sequence patterns with variation can unveil functions of a protein family that are important for drug discovery. Exploring protein families using existing methods such as multiple sequence alignment is computationally expensive, thus pattern search, called motif finding in Bioinformatics, is used. However, at present, combinatorial algorithms result in large sets of solutions, and probabilistic models require a richer representation of the amino acid associations. To overcome these shortcomings, we present a method for ranking and compacting these solutions in a new representation referred to as Aligned Pattern Clusters (APCs). To tackle the problem of a large solution set, our method reveals a reduced set of candidate solutions without losing any information. To address the problem of representation, our method captures the amino acid associations and conservations of the aligned patterns. Our algorithm renders a set of APCs in which a set of patterns is discovered, pruned, aligned, and synthesized from the input sequences of a protein family.

RESULTS

Our algorithm identifies the binding or other functional segments and their embedded residues which are important drug targets from the cytochrome c and the ubiquitin protein families taken from Unitprot. The results are independently confirmed by pFam's multiple sequence alignment. For cytochrome c protein the number of resulting patterns with variations are reduced by 76.62% from the number of original patterns without variations. Furthermore, all of the top four candidate APCs correspond to the binding segments with one of each of their conserved amino acid as the binding residue. The discovered proximal APCs agree with pFam and PROSITE results. Surprisingly, the distal binding site discovered by our algorithm is not discovered by pFam nor PROSITE, but confirmed by the three-dimensional cytochrome c structure. When applied to the ubiquitin protein family, our results agree with pFam and reveals six of the seven Lysine binding residues as conserved aligned columns with entropy redundancy measure of 1.0.

CONCLUSION

The discovery, ranking, reduction, and representation of a set of patterns is important to avert time-consuming and expensive simulations and experimentations during proteomic study and drug discovery.

摘要

背景

发现具有变异性的序列模式可以揭示对药物发现很重要的蛋白质家族的功能。使用多重序列比对等现有方法探索蛋白质家族在计算上很昂贵,因此使用模式搜索,在生物信息学中称为基序发现。然而,目前,组合算法会产生大量的解决方案,概率模型需要更丰富的氨基酸关联表示。为了克服这些缺点,我们提出了一种在新的表示形式中对这些解决方案进行排序和压缩的方法,称为对齐模式簇 (APC)。为了解决解决方案集过大的问题,我们的方法揭示了一组候选解决方案,而不会丢失任何信息。为了解决表示问题,我们的方法捕获了对齐模式的氨基酸关联和守恒性。我们的算法生成了一组 APC,其中一组模式是从蛋白质家族的输入序列中发现、修剪、对齐和合成的。

结果

我们的算法从 UniProt 中提取的细胞色素 c 和泛素蛋白家族中确定了结合或其他功能片段及其嵌入式残基,这些残基是重要的药物靶点。结果由 pFam 的多重序列比对独立确认。对于细胞色素 c 蛋白,与无变化的原始模式相比,具有变化的结果模式数量减少了 76.62%。此外,前四个候选 APC 都对应于结合片段,其中每个片段的保守氨基酸之一都是结合残基。发现的近端 APC 与 pFam 和 PROSITE 的结果一致。令人惊讶的是,我们的算法发现的远端结合位点没有被 pFam 或 PROSITE 发现,但被三维细胞色素 c 结构所证实。当应用于泛素蛋白家族时,我们的结果与 pFam 一致,并揭示了七个赖氨酸结合残基中的六个作为保守的对齐列,其熵冗余度量为 1.0。

结论

模式的发现、排序、减少和表示对于避免蛋白质组学研究和药物发现过程中的耗时且昂贵的模拟和实验非常重要。

相似文献

1
Ranking and compacting binding segments of protein families using aligned pattern clusters.利用对齐模式簇对蛋白质家族的结合片段进行排序和压缩。
Proteome Sci. 2013 Nov 7;11(Suppl 1):S8. doi: 10.1186/1477-5956-11-S1-S8.
2
Aligning and Clustering Patterns to Reveal the Protein Functionality of Sequences.比对和聚类模式以揭示序列的蛋白质功能
IEEE/ACM Trans Comput Biol Bioinform. 2014 May-Jun;11(3):548-60. doi: 10.1109/TCBB.2014.2306840.
3
Discovering co-occurring patterns and their biological significance in protein families.发现蛋白质家族中的共现模式及其生物学意义。
BMC Bioinformatics. 2014;15 Suppl 12(Suppl 12):S2. doi: 10.1186/1471-2105-15-S12-S2. Epub 2014 Nov 6.
4
Discovery and disentanglement of aligned residue associations from aligned pattern clusters to reveal subgroup characteristics.从比对模式簇中发现并解开比对残基关联以揭示亚组特征。
BMC Med Genomics. 2018 Nov 20;11(Suppl 5):103. doi: 10.1186/s12920-018-0417-z.
5
WildSpan: mining structured motifs from protein sequences.WildSpan:从蛋白质序列中挖掘结构化基序
Algorithms Mol Biol. 2011 Mar 31;6(1):6. doi: 10.1186/1748-7188-6-6.
6
Discovering Protein-DNA Binding Cores by Aligned Pattern Clustering.通过对齐模式聚类发现蛋白质-DNA结合核心
IEEE/ACM Trans Comput Biol Bioinform. 2017 Mar-Apr;14(2):254-263. doi: 10.1109/TCBB.2015.2474376. Epub 2015 Aug 28.
7
Pattern Discovery and Disentanglement for Aligned Pattern Cluster Analysis and Protein Binding Complexes Detection用于对齐模式聚类分析和蛋白质结合复合物检测的模式发现与解缠
8
Prediction of Protein-Protein Interaction via co-occurring Aligned Pattern Clusters.通过共现比对模式簇预测蛋白质-蛋白质相互作用
Methods. 2016 Nov 1;110:26-34. doi: 10.1016/j.ymeth.2016.07.018. Epub 2016 Jul 27.
9
Discovering Patterns From Sequences Using Pattern-Directed Aligned Pattern Clustering.使用模式导向对齐模式聚类从序列中发现模式。
IEEE Trans Nanobioscience. 2018 Jul;17(3):209-218. doi: 10.1109/TNB.2018.2845741. Epub 2018 Jun 8.
10
Revealing Subtle Functional Subgroups in Class A Scavenger Receptors by Pattern Discovery and Disentanglement of Aligned Pattern Clusters.通过模式发现和对齐模式簇的解缠揭示A类清道夫受体中的细微功能亚群
Proteomes. 2018 Feb 8;6(1):10. doi: 10.3390/proteomes6010010.

引用本文的文献

1
Discovery and disentanglement of aligned residue associations from aligned pattern clusters to reveal subgroup characteristics.从比对模式簇中发现并解开比对残基关联以揭示亚组特征。
BMC Med Genomics. 2018 Nov 20;11(Suppl 5):103. doi: 10.1186/s12920-018-0417-z.
2
Revealing Subtle Functional Subgroups in Class A Scavenger Receptors by Pattern Discovery and Disentanglement of Aligned Pattern Clusters.通过模式发现和对齐模式簇的解缠揭示A类清道夫受体中的细微功能亚群
Proteomes. 2018 Feb 8;6(1):10. doi: 10.3390/proteomes6010010.
3
Discovering co-occurring patterns and their biological significance in protein families.

本文引用的文献

1
Epigenetic protein families: a new frontier for drug discovery.表观遗传蛋白家族:药物发现的新前沿。
Nat Rev Drug Discov. 2012 Apr 13;11(5):384-400. doi: 10.1038/nrd3674.
2
Principles of early drug discovery.早期药物发现的原则。
Br J Pharmacol. 2011 Mar;162(6):1239-49. doi: 10.1111/j.1476-5381.2010.01127.x.
3
On position-specific scoring matrix for protein function prediction.蛋白质功能预测的位置特异打分矩阵。
发现蛋白质家族中的共现模式及其生物学意义。
BMC Bioinformatics. 2014;15 Suppl 12(Suppl 12):S2. doi: 10.1186/1471-2105-15-S12-S2. Epub 2014 Nov 6.
IEEE/ACM Trans Comput Biol Bioinform. 2011 Mar-Apr;8(2):308-15. doi: 10.1109/TCBB.2010.93.
4
The Pfam protein families database.Pfam 蛋白质家族数据库。
Nucleic Acids Res. 2010 Jan;38(Database issue):D211-22. doi: 10.1093/nar/gkp985. Epub 2009 Nov 17.
5
PROSITE, a protein domain database for functional characterization and annotation.PROSITE,一个用于功能特征描述和注释的蛋白质域数据库。
Nucleic Acids Res. 2010 Jan;38(Database issue):D161-6. doi: 10.1093/nar/gkp885. Epub 2009 Oct 25.
6
Cytochrome c biogenesis: mechanisms for covalent modifications and trafficking of heme and for heme-iron redox control.细胞色素c的生物合成:血红素共价修饰、转运及血红素铁氧化还原调控机制
Microbiol Mol Biol Rev. 2009 Sep;73(3):510-28, Table of Contents. doi: 10.1128/MMBR.00001-09.
7
Targeting the ubiquitin system in cancer therapy.在癌症治疗中靶向泛素系统。
Nature. 2009 Mar 26;458(7237):438-44. doi: 10.1038/nature07960.
8
Atypical ubiquitin chains: new molecular signals. 'Protein Modifications: Beyond the Usual Suspects' review series.非典型泛素链:新的分子信号。“蛋白质修饰:超越常见类型”综述系列。
EMBO Rep. 2008 Jun;9(6):536-42. doi: 10.1038/embor.2008.93.
9
DIALIGN-TX: greedy and progressive approaches for segment-based multiple sequence alignment.DIALIGN-TX:基于片段的多序列比对的贪心与渐进方法。
Algorithms Mol Biol. 2008 May 27;3:6. doi: 10.1186/1748-7188-3-6.
10
Characterization of polyubiquitin chain structure by middle-down mass spectrometry.通过中低分辨质谱法对多聚泛素链结构进行表征
Anal Chem. 2008 May 1;80(9):3438-44. doi: 10.1021/ac800016w. Epub 2008 Mar 20.